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INTRODUCTION 


Conventions 


Decimal numbers are shown either without a suffix or with a suffix of ‘d’ e.g. 256d. 

Hexadecimal numbers are shown with either a prefix of ‘Ox’ or a suffix of ‘h’ e.g. Ox1BO or 1BOh. 
Binary numbers are shown with a suffix of ‘b’ e.g. 001101b. 

Registers are all shown as follows: 


RO Read Only 
RW Read Write 
WO Write Only 


‘Reserved’ means that the register bits are reserved and should be set to 0. 


Addresses and Address Bits are always described as byte addresses, though many addresses in the 
system are aligned on 32bit or other boundaries, and as such do not require the low bits of the byte 
address. 


References 


[1] PowerVR 250 Data sheet 

[2] PCI 2.1 specification 

[3] AGP 1.0 specification 

[4] VMI 1.4 specification 

[5] VGA reference 

[6] Bump mapping document 

[7] VQ document 

[8] Primary Core Interface for the Tile Accelerator 
[9] Primary Core Interface for 2D operations. 
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POWERVR 250 OVERVIEW 


PowerVR 250 is a single chip 2D/3D accelerator which provides high end 3D performance PowerVR 
Second Generation technology, full Microsoft Windows™ 2D and Video acceleration and a digital port 
to an LCD interface chip or TV encoder. 


Video Data — 
=~ Video In Pixels 
Primary Core 
RGB 


Cache Memory Subsystem VGA 


Table 1. PowerVR 250 Block Diagram 
Host Interface 


The PowerVR 250 host interface supports AGP v1.0 and PCI v2.1 bus standards. The AGP interface 
is AGP 2x with side band addressing for peak bus bandwidths of 532MB/s. PowerVR 250 also 
supports a 32 deep request queue and address remapping for GART support. The PCI interface is a 
PCI-33 compliant interface with write combining support. 


PowerVR 250 performs most of it’s operations as a bus master for most efficient use of the bus. 
SDRAM Interface 
The PowerVR 250 memory interface is a 64-bit wide interface which supports 64Mbit SDRAM, 16Mbit 


SDRAM, and 16Mbit SGRAM devices. This provides a peak data rate of 800MB/s and support for 
8MB, 16MB or 32MB of memory. 


Primary Core 


The primary core is responsible for processing requests from the host and initiating 2D operations as 
well as handling other data input and output from the DAC, TV and Video In feeds. It is a microcoded 
engine which executes up to three separate threads of microcode. The background thread (thread 0) 
is responsible for host communication and control of the 2D core and two real time threads which are 
responsible for one of the DAC Feed and Video In data feeds respectively. 
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2D Core 

The 2D core performs 2D pixel operations including BLTs, color space conversion, motion 
compensation, 2D blend operations and ROPs. 

3D Core (ISP, TSP) 

The 3D core is the 3D rendering engine in PowerVR 250. It takes polygon parameters and performs 
full setup, hidden surface removal, shading, texturing and blending. 

Tile Accelerator 

The tile accelerator performs the first step in the 3D rendering pipeline for PowerVR, pre-processing a 
scene of 3D data to creating a tiled representation of it which can be read directly by the 3D core. 
DAC Feed 


Controlled by one of the primary core real time threads, which reads a display list describing the 
format of the display and feeds it into the DAC feed. The DAC feed fetches the resulting pixel data for 
the CRT display or output through the digital port to an LCD interface or TV encoder. 


Video In 

Also controlled by a real time thread, the video in feed provides a VMI 1.4 video data connection 
between a video decoder or MPEG decoder and PowerVR 250. The resulting data is written to either 
framebuffer or host by the Primary core. This gives the ability to perform video capture and display live 


video windows as well as performing VBI data capture. The VMI 1.4 host data is sent over the GPIO 
port. 


Cache Subsystem 

The cache memory subsystem is used by all other elements requiring access to memory. It provides 
read and write cacheing and arbitration to the host bus or the framebuffer. 

ROM / GPIO Interface 

The ROM / GPIO interface is an 8-bit data interface used for ROM IO, VMI host data and as a general 
purpose IO bus. 

VGA 


The PowerVR 250 provides complete functional and register compatibility with the IBM® VGA. 
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REGISTER OVERVIEW 


PowerVR 250 has a number of programmable registers which fall into different classes: 


Configuration Space Registers 


These are PCI registers accessible using PC] CONFIG cycles and are generally as specified in PCI 
related standards. A number of these registers are also accessible from the Primary Core (they are 
registers aliased as Implicit Registers). 


VGA registers 


These are l|O mapped registers (with a memory mapped alias) which are a complete set of IBM® VGA 
compatible registers. 


Immediate Registers 


These are memory mapped and IO mapped registers which are accessible from the host with a single 
atomic read or write. They are accessed by adding the register offset to the relevant base address 
register value (stored in configuration registers) to form the absolute register address. They are 
identified by being documented with an Offset address. 


Implicit Registers 


These 16-bit registers are written via the Write Access Register (WAR) and read via one of two Read 
Access Registers (RARs). They are identified by being documented with an Implicit address. The 
WAR is an immediate registers with an offset of OxO0OCO. The RARs are also immediate registers with 
offsets of and Ox00C4 and Ox00C8 respectively. The WAR has a format shown below: 


Write Access Register 


Offset 0x00CO 
Reset Value 0x00000000 


31-30 29-26 25-16 15-0 
IA_RW IA_ROUTE IMPLICIT_ADDRESS IA_ DATA 


Read Access Register 1 


Offset 0x00C8 
Reset Value 0x00000000 


31-30 
IA_RW IA_ROUTE IMPLICIT_ADDRESS IA_LRTNDATA 


Read Access Register 2 


Offset 0x00C4 
Reset Value 0x00000000 


31-30 | 29-25 15-0 
IA_RW IA_LROUTE IMPLICIT_ADDRESS |A_RTNDATA 
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IA_RW (RW) 


00 Host Read 
01 Host Write 
10 Reserved 
11 Reserved 


IA_ROUTE (RW) 


Determines the routing of the register. A value of 0001b is used for writes and reads which return data 
in Read Access Register 1. A value of 1001b is used for reads which return data in Read Access 
Register 2. 


IMPLICIT_ADDRESS (RW) 


Register address. 


IA_DATA (RW) 


16-bits of write data 


IA_RTNDATA (RW) 


16-bits for data returned from the register. 


To Write an implicit register, set IA_RW to write with the required IMPLICIT_ADDRESS and 
IA_DATA. 


The Write Access Register (WAR) and Read Access Registers (RARs) are used as described in the 
introduction to this section. 


Reading an implicit register is a 2 step process. First write to the WAR with IA_RW set to read, and 
with the required IMPLICIT_ADDRESS; IA_DATA is don’t care. Secondly, read from the required 
RAR (as determined by the IA_ROUTE bits), which have the same format as the WAR, and data is 
returned in the low 16 bits. 


NOTE: A read to a RAR always completes, even though the data may not have yet have returned 
from the requested register. In order to ensure that correct data is returned, it is necessary to check 
that the address field returned in the RAR matches that written to the WAR. 


RAR?1 is typically used for reads from the host by normal processes, leaving RAR2 as a read register 
for interrupt processes. 


Indexed Registers 


Indexed registers are 16 or 32 bit registers which are also accessed through the WAR and RAR, albeit 
in an indirect manner, by using the Implicit Registers Index Address and Index Data. They are 
identified by being documented with an Index address. 


Index Address 


Implicit Address 0x0104 
Reset Value 0x0000 


9-1 
IDX_PKT IDX_INC IDX_ADDR 
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Index Data 
Implicit Address 0x0105 
Reset Value 0x0000 


IDX_DATA 


IDX_PKT (RW) 


When writing to an index register (IA_RW set to write) IDX_PKT determines how many data packets 


to send (if the index register is a 16-bit register, then this must be set to 0). 


0 Write 1 packet to 16 LSBs of Index Register 
1 Write 2 packets to 16 LSBs and 16 MSBs 


When reading from an index register, IDX_PKT determines whether to read the LSB of the MSB of the 


index register (if the index register is a 16-bit register, then this must be set to 0). 


0 Read 16 LSBs 
1 Read 16 MSBs 
IDX_INC (RW) 


Auto increment 


1 Successive writes of data use incrementing index addresses. 


IDX_ADDR (RW) 


The index address of the desired register. 


IDX_DATA (RW) 


The data to write to / read from the index register. 


Writing to an Indexed register involves the following steps: 
Write the Register Index to the Index Address Register (via the WAR) 
Write Register Data (16 bit LSBs) to the Index Data Register (via the WAR) 
If IDX_PKT is 1, Write Register Data (16 bit MSBs) 
If IDX_INC is 1, Optionally write to further registers 


To Read from an Indexed register: 
Write the Register Index to the Index Address Register (via the WAR) 
Read data from the Index Data Register (via the RAR). 
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These are the configuration space registers for PowerVR 250 . Access to these is as defined in [2]. 


Vendor ID (RO) 


ae Ma 
Type Timer Line Size 

fd 

pe 


Expansion ROM base address 


AGP Status 


ee ee 
J ie 
ss 

Leer ey 


AGP PLL Control CORE PLL Control 


Table 2. PCI Configuration Space 


PowerVR 250 has a Vendor ID of 1033h. 


Device ID (RO) 


PowerVR 250 has a Device ID of 0067h. 
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00h 
04h 
08h 
OCh 


10h 
14h 
18h 
1Ch 
20h 
24h 
28h 
2Ch 
30h 
34h 
38h 
3Ch 
40h 
44h 
48h 
4Ch 
50h 
54h 
58h 
5Ch 
60h 
64h 
68h 
6Ch 
70h 
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Command 


15-10 Ls _|st2ztetstojstzt{its | 


Reserved | Fast Back| SERR# Wait Parity Palette Special Master Mem fe) 
to Back Cycle Error snoop Cycle Decode | Decode 


Fast Back To Back (RW) 


When set, enables fast back to back transactions on the bus. 
SERR# Enable (RO) 

Not supported. Returns 0 

Wait Cycle Control (RO) 

Not supported. Returns 0 

Parity Error Response (RO) 

Not supported. Returns 0 

Palette Snoop (RW) 

VGA Palette snoop enable. 

MWI (RW) 


Memory Write Invalidate enable. When set, PowerVR 250 uses MWI when performing PCI writes 
which fill a cache line. 


Special Cycles (RO) 

Not supported. Returns 0 

Master (RW) 

When set, PowerVR 250 can perform bus master cycles. 
Mem Decode (RW) 


When set, PowerVR 250 decodes bus memory cycles. 


10 Decode (RW) 
When set, PowerVR 250 decodes bus IO cycles. 
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0 | oo Fo ft fof w@ |i | Resemes 


Bit 
13 Received Master Abort Reset to 0 on a write 
12 Received Target Abort Reset to 0 on a write 
U1 Signalled Target Abort Reset to 0 on a write 
10-9 DevSel timing Medium 
7 Fast back to back capable 
5 66MHz Capable This is set when AGP operation is enabled and 
reset otherwise (power on strap) 
4 New capabilities supported 


Revision (RO) 


PowerVR 250 has a revision of 02h. 
Class Code (RO) 


PowerVR 250 is a VGA compatible display device with class code of 030000h 
Cache Line Size (RW) 


Specifies the system cache line size in units of 32-bit words. 
Latency Timer (RW) 


Specifies Latency for PowerVR 250. (bits 0 — 2 are always 0). 
Header Type (RO) 


This is a Type 0 header and returns 00h 
BIST (RO) 


BIST is not supported — this register returns 0 


Memory Space 0 — Frame Buffer (RW) |: 


~ 64MB pre-fetchable memory map for PowerVR 250 framebuffer. 


Memory Space 1 —- Memory Mapped Registers (RW) 


16KB pre-fetchable memory map for PowerVR 250 immediate registers. 
Memory Space 2 — VGA Frame Buffer (RW) 


» 128KB pre-fetchable memory map to allow a copy of the VGA memory space to be directly mapped. 


Memory Space 3 - Memory Mapped VGA Registers (RW) 


y 2KB non pre-fetchable memory map to allow a copy of VGA registers to be mapped. It maps IO 


Registers at addresses starting from 3COh i.e. VGA register 3COh is at offset 00h and VGA register 
3D4h is at offset 14h. 
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Memory Space 4 — 1O Mapped Registers (RW) 


256 byte 1O map for PowerVR 250 Immediate registers * 

It’s main purpose is to allow immediate registers to be programmed in real mode DOS. 

Subsystem Vendor ID (RO) 

The Subsystem Vendor ID a Read Only register which identifies the vendor of the board. It may be set 


at power up from a value stored in the adapter ROM or from a value written into the SSVIDW 
configuration register. 


Subsystem Device ID (RO) 
The Subsystem Device ID a Read Only register which identifies the board. It may be set at power up 


from a value stored in the adapter ROM or from a value written into the SSDEVW configuration 
register. 


Expansion ROM Base Address (RW) 


128KB memory map for adapter ROM. 
Caps Ptr (RO) 


Pointer to linked list of new capabilities at OO60h 
Max_Lat (RO) 


PowerVR 250 has no requirements for the setting of latency timers and this register is set to 0. 
Min_Gnt (RO) 
PowerVR 250 has no requirements for the setting of latency timers and this register is set to 0. 


Int Pin (RO) 


PowerVR 250 supports use of the INTA# pin. 
Int Line (RW) 


Interrupt routing information register. 
AGP Capabilities Block 


This block of registers is only active when PowerVR 250 powers up with AGP support enabled (power 
on resistor strap) otherwise all the AGP registers are read only and return 0. 


AGP CapID (RO) 


The AGP capabilities ID: 02h 
AGP NextPtr (RO) 


Null Pointer to next capabilities block returns OOh. 
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AGP Revision (RO) 
15-8 
0 Major 
Bit 
7-4 AGP Major Oth 


3-0 AGP Minor 00h 
AGP Status (RO) 


31 —24 23-10 i ee ee 
Ra | Reseed | spa] Reserved | Rate 


Bit 
31-24 RQ Number of command requests PowerVR 250 can 
handle. Returns 31. 
9 SBA Side band addressing support. Returns 1 
1-0 Rate Transfer Rates supported. Returns 011b (AGP 1x 


and 2x) 
AGP Command (RW) 


23 — 10 


91-24 ee ee 
RQ Depth Reserved SBA Enable | AGP Enable 


Bit 
31-24 RQ_Depth Number of requests master is allowed to enqueue 
9 SBA_Enable Enable side band addressing 
8 AGP_Enable Enable AGP transactions 
1-0 Data_Rate Enable Transfer Rate. 


Setting bit 0 enables 1x mode. 
Setting bit 1 enables 2x mode. 


Discard Timer (RW) 


Discard Timeout 


This register is an internal performance tweak register. It defaults to 7FFFh on reset. 
16T Timer (RW) 


7-0 
16T Timeout 


This register is an internal performance tweak register. It defaults to OFh on reset. It determines the 
number of PCI cycles that the AGP interface will wait during a slave read before asserting STOP on 
the PCI bus. 
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8T Timer (RW) 


7-0 
8T Timeout 


This register is an internal performance tweak register. It defaults to 07h on reset. It determines the 
number of PCI cycles that the AGP interface will wait during a slave write before asserting STOP on 
the PCI bus. 


Subsys Control 


16=2 re ee 
Reserved SUBSYS_WE | SUBSYS_SEL 


This register controls which values are reported as SubSystem Vendor and Device ID. If 
SUBSYS_SEL is set then the values written to SSVIDW and SSDEVW are used, otherwise the 
internal register values (as initialised by the primary core) are used. 


SUBSYS_WE enables writing to the SUBSYS_SEL bit if set, otherwise the bit is write only. 
SSVIDW 


15-0 
SSVIDW 


When SUBSYS_SEL is set, this register is used to provide the value read from SubSystem Vendor ID. 
SSDEVW 


15-0 
SSDEVW 


When SUBSYS_SEL is set, this register is used to provide the value read from SubSystem Device ID. 
Therefore in order to program SubSystem Vendor ID and SubSystem Device ID from the host: 


Write values to SSVIDW and SSDEVW 
Set SUBSYS_WE bit 
Set SUBSYS_SEL bit 


Power Management Capabilities Block 
PM CapID (RO) 


The Power Management capabilities ID - 01h. 
PM NextPtr (RO) 


Pointer to next capabilities block. 


If AGP operation is enabled it is set to 40h, otherwise it is set to OOh. 
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15-11 


Read only capabilities register 


Bit 
15-11 PME# support PowerVR 250 does not support PME# 
10 D2 support PowerVR 250 supports D2 
9 D1 support PowerVR 250 supports D1 
8-6 Aux Current N/A 
5 DSI Device Specific !nitialisation required 
3 PME Clock N/A 


2-0 PCI PM 1.0 compliant 


PM CSR (RW) 


14=13 12-9 pe | ee to 
po | oo | 0 po | Reserved | Pw state 


The Power Management Control/Status Register is used to manage the PCI function’s power 
management state as well as to enable/monitor power management events. 


Bit 
15 PME Status Not used — return 0 
14-13 Data Scale Not used — return 0 
12-9 Data Select Not used — RW bits 
8 PME_EN Not used — RW bits 
1-0 PM_State Determine / set the current power state: 
00b — DO 
01b-D1 
10b — D2 
1 1b — D3nor 
PM BSE (RO) 


Power Management Bridge Extensions are not supported by PowerVR 250. This register returns OOh. 
PM Data (RO) 


Power Management Data is not supported by PowerVR 250. This registers returns 0. 
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AGP PLL Control (RW) 


Reset Value 0x0000 /) - 


15-3 1 Po 
Reserved AGP_PLLMask AGP_PLLStandby 


AGP_PLLMask (RW) 

Masks the clock during PLL's lockup time. When "0", this bit masks the output clock of the AGP PLL. 
When "1", this bit does not mask the output clock. 

After reset, and before AGP use, this bit must be set to ‘1’. 

AGP_PLLStandby (RW) 

This bit controls the standby mode of the AGP PLL. When this value is "0", the PLL operates normally. 
When "1" the PLL enters standby mode. 

CORE PLL Control (RW) | rqigivos PI 


fe 


14 13-8 7-0 
SPAM_RST | CTRL_CLK CTRL_ADDR CTRL_DATA 


SPAM_RST (RW) 


This bit resets the SPAM module which contains all the PowerVR 250 Immediate registers. Leave at 0. 
CTRL_CLK (RW) 


This and the two following registers allow the core clock PLL to be programmed from the host without 
relying on internal clocks. This procedure is normally performed at reset time only (by the BIOS) and 
only takes affect when the chip is reset using the software reset register. 


Programming is achieved by programming the CTRL_ADDR with an address and CTRL_DATA with 
data and manually clocking it by setting the CTRL_CLK to ‘1’ and then to ‘0’. 


CTRL_ADDR (RW) 


Address 

0x00 Reserved 
0x01 Core PLL 0 
0x02 Core PLL 1 
0x03 Core PLL 2 
0x04 Reserved 


CTRL_DATA (RW) 


The format for CTRL_DATA is dependant upon CTRL_ADDR. Core PLL programming is described in 
the clocking section below. 
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CLOCKING 


PowerVR 250 has three on chip PLLs. One is a phase aligner for AGP timing and is controlled by the 
AGP PLL Control configuration register. The second is the Core PLL, which is programmed via the 
CORE PLL Control configuration register. The last is the DAC PLL controlling the output pixel rate, 
which is programmed as described below. 


DAC PLL 0 
Offset 0x0080 Implicit Address 0x0120 
Reset Value 0x0021 


14-12 
DACPLL_SO DACPLL_P DACPLL_N DACPLL_M 


This register is one of two that controls the setup of the DAC clock PLL. Writing to this register does 
not take effect until the DAC PLL 1 register is written to. 


DAC PLL 1 
Offset 0x0084 Implicit Address 0x0121 
Reset Value 0x0000 


15-2 ee re 
Reserved DACPLL_MB DACPLL_S1 


This register is the second which controls the setup of the DAC clock PLL. Writes to this register 
cause all 17 bits of PLL control word to be written. 


The output frequency of the PLL is defined as 
Fout = Fref *N/(M * P) 
Where Fref is the oscillator input (typically 14.31818MHz). 
DACPLL_M (RW) 
In the equation above M = (DACPLL_M + 1). 
Valid values for DACPLL_M are 00001b to 11111b. 
DACPLL_N (RW) 
In the equation above N = (DACPLL_N + 1). 
Valid values for DACPLL_N are 0000001b to 1111111b. 
DACPLL_P (RW) 
DACPLL_P controls the output divider P as follows: 


O000b P=1 
OOib P=2 
O11b P=4 
111b P=8 
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SO and S1 are chosen to match the frequency the VCO in the PLL is operating at and ensures 
minimum jitter. The VCO frequency is: 


Fvco = Fref* N/M 


$1 So Fvco range 
0) 0 50 — 110MHz 
0 1 110 — 170MHz 
1 0 170 — 240MHz 
1 1 240 — 310MHz 


DACPLL_MB (RW) 


1 = Power Down PLL, 0 = normal operation 


Core PLL 0 


Offset 
Reset Value 


N/A 
0x21 


7-5 


COREPLL_N_LO 


Core PLL 1 

Offset 

Reset Value 
7 


COREPLL_SO 


Core PLL 2 


Offset 
Reset Value 


N/A 
0x00 


6-4 
COREPLL_P 


Reserved 


COREPLL_M 


COREPLL_N_HI 


a ee 
COREPLL_MB COREPLL_S1 


Core PLL register fields have the same meaning as DAC PLL fields described above. They are 
programmed by using the Core PLL Control register in the configuration register block. 
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INTERRUPTS & RESET 

PowerVR 250 supports a number of internal interrupt sources which are controlled through the 
interrupt mask and interrupt status register. 

Interrupt Mask 


Offset 0x0074 Implicit Address 0x014D 
Reset Value OxFFFF 


4 | 2 | 2 | wv | wo | » | 6s | 
MASTER VGA IICB_START | IICA_START C1_IRQ3 C1_IRQ2 C1_IRQ1 C1_IRQO 


pe | 6s | « | s | 2 [| + JT oo | 
PVR_IRQ PCI_LABORT | VID_OFLOW VID_EOF DAC_URUN DAC_VS Reserved (1) DAC_LINE 


This register masks interrupts. Writing a ‘1’ to a bit field masks the relevant interrupt. All the bit fields 
correspond exactly to those in the Interrupt Status register. 


All interrupts may be masked by setting the MASTER mask bit. 


Interrupt Status 


Offset 0x0070 Implicit Address 0x014C 
Reset Value 0x0000 


4 | iw | 2 | mw | wo | 2 | is | 
MASTER VGA IICB_START | IICA_START C1_IRQ3 C1_IRQ2 C1i_IRQ1 C1_IRQO 


a ee ee 


PVR_IRQ 


PCI_ABORT 


VIDIN_ VIDIN_EOF DAC_URUN DAC_VS Reserved DAC_LINE 
OFLOW 


The interrupt status register indicates the source of any interrupt caused by PowerVR 250 . These 
interrupts only result in an external (PCI) interrupt if the relevant mask bit in the interrupt mask register 
is zero. All the bits in the register are latched, that is they remain set even when the condition causing 
them has ended. Reading this register does not change the state of the bits and the bits are reset by 
writing a ‘1’ to the required bit to be reset. 


DAC_LINE 


Generated when the DAC scanline reaches the value set in the DAC line compare register (see 
below). 


DAC_VS 

Generated at the start of the DAC vertical sync period. 
DAC_URUN 

Generated if the DAC is started of data. 


VIDIN_EOF 


Generated when the video capture data reaches either VSync or the end of the capture region 
(determined by a CPT instruction to the video in core). 
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VIDIN_OFLOW 


Generated if an overflow occurs in the video capture FIFO. 


PCl_ABORT 
Caused by a PCI_LABORT condition on the bus. 


L PVR_IRQ 


Generated by any interrupting condition from the 3D pipeline. « 
C1_IRQO, C1_IRQ1, C1_IRQ2, C1_IRQ3 

Generated by writes to the Primary Core Soft Interrupt register 
lICA_START, IICB_START 


Caused by a start condition on IIC bus A or B 


VGA 


Generated by an interrupting condition from the VGA. This is the only interrupt input which is not 
latched (which allows it to be reset through VGA registers). This is the only method for the VGA core 
to interrupt the host. 


MASTER 


This is an OR of all the other interrupt inputs to the register. It is not latched, i.e. it becomes inactive as 
soon as no other interrupting conditions exist. 


DAC Line Compare 


Offset 0x0060 Implicit Address 0x0128 
Reset Value 0x0000 


14 13-12 
DAC_VP DAC_HP DAC_LINE_COMP 


DAC_LINE_COMP (RW) 


Causes an interrupt condition when the current DAC scanline is equal to the value programmed. Lines 
are counted from the leading edge of VSYNC. 


DAC_HP (RW) 
Sets the Horizontal sync polarity - 0 = active HI, 1 = active LO 
DAC_VP (RW) 


Sets the Vertical sync polarity - 0 = active HI, 1 = active LO 
DAC Line Count 


Offset 0x0064 Implicit Address 0x0129 
Reset Value 0x0000 


15-12 
Reserved DAC_LINE 
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Returns the current scanline on the DAC output. Reset on VSYNC 


Software Reset 


Offset 0x00A4 Implicit Address 0x0169 
Reset Value Ox7FFF 


14 13-12 po | om | 6 | 6s | 
SD_DIS REG_RST Reserved (1) Reserved (1) | _TA_RST ROM_RST | VGA_RST 


se | 5s | « | 5s | 2 | +1 | o | 
SD_RST | CACHE_RST | VIDIN_RST | Reserved(1) | C2_RST C1_RST | Reserved (1) | DAC_RST 


This register allows modules of PowerVR 250 to be reset. In practise these bits should only be used 
together in a group and modules should not be individually reset. 


All resets are active low. Write a 0 to reset the respective module and write a 1 to take that module out 
of reset. 


DAC_RST (RW) 
DAC-Core reset 
C1_RST (RW) 

Primary Core Reset 
C2_RST (RW) 
Secondary Core Reset 
VIDIN_RST (RW) 
Video-In Reset 
CACHE_RST (RW) 
Memory subsystem Reset 
SD_RST (RW) 

SDRAM Interface Reset 
VGA_RST (RW) 
VGA-Core Reset 
ROM_RST (RW) 

ROM Interface Reset 
TA_RST (RW) 

Tile Accelerator Reset 
REG_RST (RW) 


Resets Register interface. This also has the side effect of resetting the PVR block of logic. 
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SD_DIS (RW) 


This is not a reset bit. Setting it to 1 to tri-states the SDRAM interface signals, which would allow the 
bus to be used by another agent. Normally set to 0. 
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ARBITRATION SUBSYSTEM 


The arbitration subsystem in PowerVR 250 is responsible for arbitrating between modules requiring 
access to either the SDRAM or the host interface. Modules requests go through selectors which 
determine whether the request is for the host interface or the framebuffer. The arbiter prioritises 
accesses to the respective interface. 


| 


SDRAM 


Arbiter 


— | | 
ee | 


AGP 


Table 3. Arbitration Subsystem 


Arbitration priority is as shown below (in decreasing priority). 


DAC Feed Reads 


Primary Core Reads (Video In thread) 


Primary Core Reads (Background thread) 


Primary Core Reads (DAC feed thread) 


Primary Core Instruction Reads 8 


Primary Core Data Cache Writes 
Primary Core Write Sorter Writes 


3D Texture Parameter Reads 


3D Region Reads 


3D Pointer Reads 


3D ISP Parameter Reads 


Secondary Core Reads 


Tile Accelerator Control Data Reads 


Secondary Core Writes 


Tile Accelerator Writes 


Tile Accelerator Object Data Reads 


Host Reads 


3D Texture Reads 


Host Writes 


3D Render Writes 
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Address Remapping 


The arbiter performs a number of address translation steps to route accesses either to the framebuffer 
or the host interface as well as to automatically remap some accesses from the framebuffer to the host 
in order to support automatic overflow of data into host memory. 


Frame Buffer Offset 


Implicit Address 0x0287 
Reset Value 0x0000 

— 
FB_OFF (RW) 


The 6-bits specified for the Frame Buffer offset are the top 6-bits of a 32-bit byte address. This allows 
the 64MB frame buffer to be located anywhere in the 4GB address space. If the top 6-bits of the 
address of any request to the cache subsystem do not equal FB_OFF then the request goes to Host 
interface. All other accesses then go to the framebuffer, except for accesses which are further 
checked for GART redirection as described below. 


The only accesses that are not tested against FB_OFF are host (slave) accesses, DAC Feed reads 
and Primary Core instruction reads which always go to the framebuffer. 


3D Address Base 


Implicit Address 0x0286 
Reset Value 0x0000 
15-12 


ISP_PARAM_BASE ISP_PTR_BASE ISP_REGION_BASE TSP_PARAM_BASE 


ISP_PARAM_BASE (RW) 
ISP_PTR_BASE (RW) 

ISP_REGION_BASE (RW) 
TSP_PARAM_BASE (RW) 


The 3D ISP Parameter, 3D Pointer, 3D Region and Texture Parameter ports onto the arbiter only 
supply a 28 bit byte address. Each of these fields represents the top 4 bits of a 32-bit byte address for 
the respective ports to allow a full 4GB address range. This address translation step occurs before the 
GART remapping described below. 


GART TA Remap Lo 


Implicit Address 0x0289 
Reset Value OxOOFF 
15-8 
GART_TA_OFFSET 
GART TA Remap Hi 
Implicit Address 0x028A 
Reset Value 0x0000 
Reserved 
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Implicit Address 0x028B 
Reset Value OxOOFF 


15-8 
GART_ISP_OFFSET GART_ISP_CMP 


GART ISP Param Remap Hi 


Implicit Address 0x028C 
Reset Value 0x0000 


Reserved GART_ISP_REMAP 


GART TSP Param Remap Lo 


Implicit Address 0x028D 
Reset Value OxOOFF 


16-8 
GART_TSP_OFFSET GART_TSP_CMP 


GART TSP Param Rernap Hi 


Implicit Address 0x028E 
Reset Value 0x0000 


15-6 
Reserved GART_TSP_REMAP 


GART_TA_CMP, GART_TA_REMAP, GART_TA_OFFSET (RW) 
GART_ISP_CMP, GART_ISP_REMAP, GART_ISP_OFFSET (RW) 
GART_TSP_CMP, GART_TSP_REMAP, GART_TSP_OFFSET (RW) 


32 bit byte write addresses from the Tile Accelerator are remapped as follows: 


if (Addr[25:18] > GART TA CMP) then 
{ 


Addr [31:26] 
Addr [25:18] 


GART_TA_REMAP 
Addr [25:18] + GART_TA OFFSET 


} 


This means that tile accelerator writes beyond a given 256KB boundary in the framebuffer 
(GART_TA_CMP) are remapped to a 64MB region of host memory where the GART table is mapped 
(GART_TA_REMAP) and offset to any 256KB boundary within that region (GART_TA_OFFSET). The 
addition performed with GART_TA_OFFSET is an unsigned addition and overflow bits are discarded 
such that the final address can wrap within the 64MB region. 


The same remapping is performed for 3D ISP parameter reads and 3D Texture Parameter reads, 
controlled by the GART ISP and GART TSP remap registers respectively. 
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Cache Control 
Writeburster Timeout 


Implicit Address 0x0288 
Reset Value 0x0000 


15-8 
Reserved WB_TO 


WB_TO (RW) 
The time in clock ticks before the 2D write burster flushes it’s contents out to memory. 


Writeburster Page Break Cost 


Implicit Address 0x0289 
Reset Value 0x0000 


Reserved WB_PAGEBRK_COST 


WB_PAGEBRK_COST (RW) 


Cost in clock cycles for a page break. This is used by the write burster to optimise memory accesses. 
Writeburster Page Hit Cost 


Implicit Address 0x028A 
Reset Value 0x0000 


15-8 
Reserved WB_PAGEHIT_COST 


WB_PAGEHIT_COST (RW) 
Cost in clock cycles for page hit. This is used by the write burster to optimise memory accesses. 


Writeburster Burst Cost 


Implicit Address 0x028B 
Reset Value 0x0000 


6-8 
Reserved WB_BURST_COST 


WB_BURST_COST (RW) 
Max time a burst can take on the bus. This is used by the write burster to optimise memory accesses. 


Writeburster Burst Length 


Implicit Address 0x028C 
Reset Value 0x0000 


_ 
Reserved WB_BURST_LEN 


WB_BURST_LEN (RW) 


Maximum burst length for a write. This is used by the write burster to optimise memory accesses. 
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HOsT INTERFACE 


The PowerVR 250 host interface supports AGP v1.0 and PCI v2.1 bus standards. The AGP interface 
is AGP 2x with side band addressing for peak bus bandwidths of 532MB/s. PowerVR 250 also 
supports a 32 deep request queue. The PCI interface is a PCI-33 compliant interface with write 
combining support. In Master transactions, the AGP interface takes read or write requests from the 
arbiter. These may be either AGP or PCI requests. AGP read requests are queued in a request queue. 


PCI Cycle Base Address 
Implicit Address 0x0292 
Reset Value OxFFFF 
15-0 
PCI_LBASE_ADDR 


PCI_BASE_ADDR (RW) 


If the top 16 bits of an address issued to the host interface match PCI_BASE_ADDR, then the request 
is issued as a PCI request rather than an AGP request. This is used to ensure cache coherency with 
the host in a single 64KB region of host memory. 


AGP Flush 
Implicit Address 0x028F 
Reset Value 0x0000 


Reserved AGP_FLUSH 


AGP_FLUSH (RW) 


When set, this bit enables the flush command for AGP master cycles. A flush will occur on the bus 
when this bit is set and a read access occurs to the top 64-bit word address in the PCI window. 


A flush command ensures that low priority AGP accesses have become visible to the host system. It 
therefore guarantees that there are no transactions pending in the AGP pipeline and that writes to 
system memory have completed. 


AGP Fence 
Implicit Address 0x0290 
Reset Value 0x0000 


Reserved AGP_FENCE 


AGP_FENCE (RW) 


When set, this bit enables a fence command for AGP master cycles. A fence will occur on the bus 
when this bit is set and a write access occurs to the penultimate 64-bit word address in the PCI 
window. 


A fence command ensures that all low priority AGP accesses issued before the fence are completed 
before any low priority accesses following the fence. 
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SDRAM INTERFACE 


The memory interface consists of two primary sections, a Synchronous DRAM (SDRAM) interface and 
an arbiter to prioritise commands issued by the various requestors. 


The SDRAM interface supports the following types of SDRAM and SGRAM devices, with a maximum 
of 4 physical banks, except when used with 64Mb SDRAM devices when 8 physical banks are 
supported. 


SDRAM 

16 Mbit - 8 bit wide, up to 32 MBytes 

16 Mbit - 16 bit wide, up to 16 MBytes 

2 64 Mbit - 32 bit wide, up to 32 MBytes .y f° 

ESDRAM 

16 Mbit - 16 bit wide, up to 16 MBytes 
SGRAM 

16 Mbit - 32 bit wide, up to 8 MBytes 


If SGRAM devices are used, the DSF pin must be tied low in order to render its operation identical to 
that of an SDRAM. The interface has programmable timing registers that allow for performance to be 
maximised according to the clock frequency used, and for the timing constraints of different devices. 


Standard SDRAM and SGRAM devices contain two logical banks of data, enabling two pages to be 
held active in any one physical bank at a time. The SDRAM interface supports up to 4 physical banks 
of data, and can therefore hold up to a maximum of 8 pages open at any one time. The use of this 
feature of SDRAMs and SGRAMs can reduce the number of page breaks required. 


When large amounts of memory are used, it may become necessary (for bus loading purposes) to 
register data lines using bi-directional transceivers. A direction pin SDDIR has been provided for this 
purpose. 


The registers need to be programmed to meet the constraints of the SDRAM devices used. Default 
values support operation at 100 MHz, using NEC 16 MBit SDRAMs (DASH 10, 16 bit wide). These 
values may be altered whilst the SDRAM interface is held in reset using the reset register. When the 
interface is brought out of reset the SDRAMs are automatically initialised and the mode register 
programmed. 


SDRAM / Arbiter Config 0 


Implicit Address 0x0284 
Reset Value 0x001F 
5-8 
DAG_LAT_COUNT 


LAT_COUNT (RW) 


The arbitration mechanism is optimised for reducing page breaks in memory and requests to open 
pages normally take priority over other requests. LAT_COUNT is the number of clock cycles a request 
can be kept waiting by the arbiter granting access to in page requests before the arbiter re-arbitrates 
ignoring whether requesters are in page or not. 


DAC_LAT_COUNT (RW) 


DAC_LAT_COUNT is the number of clock cycles DAC Feed reads can be kept waiting by the arbiter 
granting access to in page requests before the arbiter re-arbitrates ignoring whether requesters are in 
page or not. 
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SDRAM / Arbiter Config 1 


Implicit Address 0x0285 | 
Reset Value 0x0003 


15-10 ee 
Reserved TILE_MODE MEM_TYPE ARB_OVERRIDE ARB_CTL 


ARB_CTL (RW) 


The arbiter decides on which requester to grant access to normally by priority only, but the arbiter can 
be set to operate in mocies which operate differently as shown below: 


00b Priority only 

01b Priority with override 

10b Priority with counter override 
11b Priority with last win override 


“Priority with override” makes a port selected in the ARB_OVERRIDE field the highest priority port. 
“Priority with counter override” means that any port which is waiting longer than the value programmed 
in the LAT_COUNT field will become highest priority and “Priority with last win override” gives highest 
priority to the requestor which was selected in the previous clock cycle. 


ARB_OVERRIDE (RW) 


Selects which port has highest priority when the arbiter is in “Priority with override” mode. 


0000b Host / 3D render writes 

0001b Host / Texture reads 

0010b Tile Accelerator Object Data reads 

0011b Tile Accelerator / Secondary Core writes 
0100b Tile Accelerator Control Data / Secondary Core reads 
0101b 3D ISP Parameter / 3D Pointer reads 

0110b 3D Region / 3D Texture Parameter reads 
0111b Primary Core writes 

1000b Primary Core Instruction reads 

1001b Primary Core DAC thread reads 

1010b Primary Core Video / Background thread reads 
1011b DAC Feed reads 

1100b reserved 

1101b reserved 

1110b reserved 

1111b reserved 


MEM_TYPE (RW) 


The memory type is the type of SDRAM/SGRAM device used and affects the address mapping to 
provide contiguous address space. 


00b  16Mbx16 SDRAM / 64Mbx32 SDRAM (16MB only) 
01b  16Mbx8 SDRAM 

10b  16Mbx32 SGRAM 

11b  64Mbx32 SDRAM (32MB only) 


TILE_MODE (RW) 


This controls the tiling of the SDRAM memory. 


00b No Tiling 
O1b 64x32 byte tiles 
10b 128x16 byte tiles 
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11b 256x8 byte tiles 


Graphics framebuffers are used as two dimensional arrays of pixels but typically the memory in which 
they are stored is arranged a one-dimensional (linear) array of memory locations organised into 
separate pages. SDRAM accesses to locations within the same page usually occur within a single 
cycle, but memory accesses to a different page incur a 6-10 clock cycle penalty. The page-size of 
16Mbit and 64Mbit SDRAM on a 64-bit system is 2048 bytes which means that in modes such as 
1024x768 16bpp, every screen scanline is in a different memory page. This makes drawing tall, 
narrow rectangles which cross many page-boundaries much slower than drawing short, wide ones. 


Memory tiling PowerVR 250 allows the cost of some 2D operations to be reduced by rearranging the 
pixels in memory so that the SDRAM pages map to small rectangular tiles in the framebuffer instead of 
lines. For example in the 128x16 byte tile mode, each SDRAM page represents 128 horizontal BYTE 
locations in 16 different scanlines. This means that small (e.g. 8x8 pixel, 16bpp) rectangles which don't 
cross any tile boundaries can be drawn without any page-breaks at all. In addition the cost of breaking 
page when drawing across two horizontally adjacent tiles is reduced by mapping horizontally adjacent 
tiles to different banks within the SDRAM array. 


- PowerVR 250 memory tiling assumes that the screen stride is 2048 bytes and treats bits 0-11 of the 


linear-framebuffer address as an x-position and the other most significant bits as a y-position. It then 
rearranges these bits into SDRAM page and bank numbers and offsets within a page such that all 
three fields contain a mix of x and y bits. While the assumption of a 2048 byte line stride may not be 
optimal for all screen formats, it does provide much better performance than untiled memory for a 
great many screen formats and BLT sizes. 


SDRAM Refresh 


Implicit Address 0x0283 
Reset Value 0x0020 


15-8 
Reserved SD_REFRESH 


SD_REFRESH (RW) 


The refresh value is the number of clock cycles between refreshes divided by 48. 
If 1 refresh is required every 15.625 us then, at 100 MHz: 
Clock period : 7 =10ns 


Clock cycles between refreshes : y = he = 1562.5 


fet 
48 
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Implicit Address 0x0280 
Reset Value 0x8981 


9-6 
ACT_TO_PRE 


Implicit Address 0x0281 
Reset Value ; -0x1040 


14-13 | 12 
SD_TRI_|SD_MODE| SD_DBL | CAS_LAT ACT_TO_ACT | RD_TO_WR_ | REF_TO_ACT HI 


SDRAM Config 2 


RD_TO_PRE 


15-14 13-10 


REF_TO_ACT PRE_TO_ACT 


SDRAM Config 1 


5-4 
ACT_TO_RW 


WR_TO_PRE 


Implicit Address 0x0282 
Reset Value 0x3101 


15-8 a ee ee ee ee 
Resenied 
SD_TRI (RW) 
Setting this bit tri-states all SDRAM interface signals, allowing another agent to use the SDRAM bus. 


SD_MODE (RW) 


Sets the mode of SDRAM operation in conjunction with MEM_TYPE 


O0Ob 16Mb SDRAM Mode & 64Mb SDRAM (32MB only) 
O01b 64Mb SDRAM Mode (16MB only) 
10b reserved 
11b reserved 
SD_DBL (RW) 


For medium loading upon the data bus it may be necessary to externally buffer the address, CS, RAS, 
CAS and WE lines. For this reason internal registers have been built into the write data path and which 
align the data with these registers. 


0 = Double buffer registering disable 
1 = Double buffer registering enable 


CAS_LAT (RW) 


Sets the CAS latency in the SDRAM mode register. 


000b _—sreserved 

001b CAS latency of 1 
010b CAS latency of 2 
011b CAS latency of 3 


ARB_PGBRK (RW) 


The interval, in clock cycles, that the arbiter waits after a page break starts before it’s next access. 
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SD_PWR (RW) 


Setting this bit puts the SDRAM interface into self refresh low power mode. 


SD_INIT (RW) 


Setting this bit ensures that the SDRAM mode is initialised after an initial refresh cycle, which is 
required by some memory devices. 


Command Intervals 


The command intervals control the speed of operation of the SDRAM Interface. The values for these 
bits are directly related the synchronous timings for the SDRAM devices used. 


The read command to return data delay is the number of clock cycles minus 1 that it takes for read 
return data to be registered into PowerVR 250 after a read command has been issued. This is mainly 
affected by CAS latency, which must have a value of 3 for operation at 100 MHz. The read to write 
command interval is equal to the CAS latency plus 2, allowing for a minimum of 1 idle clock cycle on 
the SDRAM data bus between reads and writes. 


The refresh to activate and activate to read/write periods are the minimum number of clock cycles 
between these commands minus 2. All other command periods are equal to the number of clock 
cycles minus 1. The timing values for these registers can be calculated from the clock frequency and 
the SDRAM device data sheet. Below is a table indicating correct values for the NEC SDRAM 
uPD4516161-A10 and uPD4516161-10 parts when operating at 100 MHz. The default value is the 
uPD5616161-10 values. 


Note: The minimum Activate to read/write command interval is 0x1. 


NEC uPD4516161A-10 or NEC uPD4516161-10 Samsung KM416S1020CT-L or 
NEC uPD4516161A-10B a uPD4516161-A10 Rev. 


Read command to returned data 
delay 


CAS Latency value mode register 
in SDRAM 


Activate to Activate Period 
Read To Write Period 

Refresh To Activate Period 
Pre-charge To Activate Period 
Activate To Pre-charge Period 


Activate To Read/Write 
Command Interval 


Write to Pre-charge Period 


Read to Pre-charge Period 


Table 4. SDRAM Timings for 100MHz 


Version Draft A 33 11 December 1998 


VideoLogic ~~ fe 


The following figure shows the timings for the SDRAM for a CAS latency of 3 without transceivers. 


Ons | 25ns 50ns 
| | | | | | | 


Tas 
CAS latency = : 
Ta 


Comman 


Address/Dat 


Symbol 


-j ( 
5 ( 
5 ( 


SDRAM address/data-in hold time 
PowerVR 250 access time from SDCLK 5.5 (max — TBD) 
PowerVR 250 address/data-in setup time 


PowerVR 250 address/data-in hold time 


Tpmx_setup 
Tpmx_hold 
Table 5. SDRAM Read Timing 


SDRAM Power Status 


Implicit Address 0x0293 
Reset Value 0x0000 


5-1 a ae 
Reserved SD_PWR_STAT 


SD_PWR_STAT (RO) 


This bit indicates whether the SDRAM devices have been powered down into self refresh mode by the 
SDRAM interface. A value of 1 indicates normal operation and a value of 0 indicates self refresh 
mode. 
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VGA 


PowerVR 250 incorporates a fully compliant VGA emulator. 
VGA Page Select 


Offset 0x00A0 Implicit Address 0x0168 
Reset Value 0x0000 


14 oto | 
VGAPAGE_EN VGA_PAGE 


This register allows mapping of different areas of the framebuffer into the VGA memory space 
(OxA0000, 0xBO000 or 0xB8000). Each page is 64KB long and up to 1024 may be mapped (depending 
on physical memory). 


VGA_PAGE (RW) 


Select which page to map. 


VGAPAGE_EN (RW) 
Enable VGA paging. Paging works in both true VGA and VGA emulation mode. 
Emulation FB Base 


Implicit Address 0x01F4 
Reset Value 0x0000 


15-11 11 21 | oo 
EM_F®_BASE EM_CLR 


EM_CLR (RW) 

Clears emulation status register when set to 1 and then back to 0. 

EM_FB_BASE (RW) 

The VGA Emulator supports the relocation of memory accesses from the host into a non zero 


framebuffer memory location. 


When accessing framebuffer memory from the host in VGA emulation mode (i.e. through the normal 
VGA memory addresses of 0xA0000, 0xB0000 or 0xB8000), the framebuffer is organised as four 
128KB interleaved planes of memory, occupying 512KB of address space. 


EM_FB_BASE sets the base address for host accesses when in VGA emulation mode and is the top 
9 bits of the 25 bit QWORD address to SDRAM, allowing the base address for emulation accesses to 
be aligned on any 512KB boundary. 


Emulation Status 


Implicit Address 0x01F5 
Reset Value 0x0000 


Reservec EMS_MISC | EMS_ATTR | EMS_GRA EMS_SEQ | EMS_CRTC 


When using VGA Emulation, accesses to VGA IO registers are redirected and used to emulate VGA 
operation. The Emulation status register indicates which registers have been accessed. The register is 
reset by any read to itself or any of the other VGA emulation registers. Each status bit tracks access to 
the following emulation registers: 
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Status for writes to all the VGA CRT controller registers (0x3B5 / 0x3D5) 


EMS_SEQ (RO) 


Status for writes to all the VGA sequencer registers (0x3C5) 


EMS_GRA (RO) 


Status for writes to all the VGA graphics registers (Ox3CF) 


EMS_ATTR (RO) 


Status for writes to all the VGA attribute registers (0Ox3C0) 


EMS_MISC (RO) 


Status for writes to the VGA miscellaneous register (0x3C2) 


Emulation Palette F/E 


Implicit Address 0x01F6 
Reset Value 0x0000 
15-14 13-8 


EM_PAL_F 


Emulation Palette D/C 


Implicit Address 0x01F7 
Reset Value 0x0000 


19-8 


EM_PAL_D 


Emulation Palette B/A 


Implicit Address Ox01F8 
Reset Value 0x0000 


19-8 


EM_PAL_B 


Emulation Palette 9/8 


Implicit Address 0x01F9 
Reset Value 0x0000 
15-14 13-8 


EM_PAL. 9 


Emulation Palette 7/6 


Implicit Address Ox01FA 
Reset Value 0x0000 


15-14 13-8 
EM_PAL? 
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Reserved EM_PAL_E 


EM_PAL_C 


EM_PAL_A 


EM_PAL.6 


EM_PAL.6 
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Emulation Palette 5/4 


Implicit Address 0x01FB 
Reset Value 0x0000 


15-14 
EM_PAL. 5 EM_PAL_4 


Emulation Palette 3/2 


Implicit Address 0x01FC 
Reset Value 0x0000 


Emulation Palette 1/0 


Implicit Address 0x01FD 
Reset Value 0x0000 


18-14 
EM_PAL_1 EM_PAL_0 
EM_PAL_x (RO) 


All the EGA Palette emulation registers are updated with EGA palette writes from the host when 
PowerVR 250 is in VGA emulation mode. 


Emulation Register 9 


Implicit Address 0x01FE 
Reset Value 0x0000 


19-8 
EM_ATTR_CPE EM_ATTR_OSCAN 


EM_ATTR_OSCAN (RO) 
Emulation register for VGA Attribute Controller Overscan Color register. 
EM_ATTR_CPE (RO) 


Emulation register for VGA Attribute Controller Color Plane Enable register 


Emulation Register 0x0A 


Implicit Address Ox01FF 
Reset Value 0x0000 


15-12 
EM_ATTR_COLSEL EM_ATTR_HZPAN EM_ATTR_CTL 


EM_ATTR_CTL (RO) 
Emulation register for VGA Attribute Controller Mode Control 
EM_ATTR _HZPAN (RO) 


Emulation register for VGA Attribute Controller Horizontal PEL Panning 
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Emulation register for VGA Attribute Controller Color Select 


Emulation Register 0x0B 


Implicit Address 0x0200 
Reset Value 0x0000 


EM_GRA_MASK EM_GRA_CDC EM_GRA_MISC 


EM_GRA_MISC (RO) 


Emulation register for VGA Graphics Controller Miscellaneous Register bits 3 - 0 
EM_GRA_CDC (RO) 

Emulation register for VGA Graphics Controller Color Don’t Care Register bits 3 — 0 
EM_GRA_MASK (RO) 


Emulation register for VGA Graphics Controller Bit Mask Register 
Emulation Register 0x0C 


Implicit Address 0x0201 
Reset Value 0x0000 


14-13 
EM_GRA_READSEL EM_GRA_ROTATE EM_GRA_MODE 


EM_GRA_MODE (RO) 


Emulation register for VGA Graphics Controller Mode Register bits 6 - 0 


EM_GRA_ROTATE (RO) 
Emulation register for VGA Graphics Controller Data Rotate Register bits 4 - 0 


EM_GRA_READSEL (RO) 
Emulation register for VGA Graphics Controller Read Map Select Register bits 1 - 0 
Emulation Register 0x0D 


Implicit Address 0x0202 
Reset Value 0x0000 


15-12 
Reserved EM_GRA_CC EM_GRA_ESR EM_GRA_SRR 


EM_GRA_SRR (RO) 


i 
w 
' 
oO 


Emulation register for VGA Graphics Controller Set/Reset Register bits 3 - 0 


EM_GRA_ESR (RO) 


Emulation register for VGA Graphics Controller Enable Set/Reset Register bits 3 - 0 
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EM_GRA_CC (RO) 

Emulation register for VGA Graphics Controller Color Compare Register bits 3 - 0 


Emulation Register 0x0E 


Implicit Address 0x0203 
Reset Value 0x0000 


EM_CRT_CURST (RO) 
Emulation register for VGA CRT Controller Cursor Start Register bits 4 — 0 


EM_CRT_CUREND (RO) 
Emulation register for VGA CRT Controller Cursor End Register bits 4 — 0 


Emulation Register 0x0F 


Implicit Address 0x0204 
Reset Value 0x0000 


EM_CRT_CURHI EM_CRT_CURLO 


EM_CRT_CURHI (RO) 


Emulation register for VGA CRT Controller Cursor Location High Register 


EM_CRT_CURLO (RO) 
Emulation register for VGA CRT Controller Cursor Location Low Register 


Emulation Register 0x10 


Implicit Address 0x0205 
Reset Value 0x0000 


7-0 
EM_CRT_MODE (RO) 


Emulation register for VGA CRT Controller Mode Control Register 


EM_CRT_ULINE (RO) 
Emulation register for VGA CRT Controller Underline Location Register bits 6 - 0 
Emulation Register 0x11 


Implicit Address 0x0206 
Reset Value 0x0000 


EM_CRT_ROWSCAN EM_CRT_DBLSCAN 
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EM_CRT_HSYNCDELAY EM_CRT_HZSKEW EM_CRT_CURSKEW EM_CRT_CUROFF 


EM_CRT_CUROFF (RO) 


Emulation register for VGA CRT Controller Cursor Start Register - Cursor Off bit. 


EM_CRT_CURSKEW (RO) 


Emulation register for VGA CRT Controller Cursor End Register Cursor - Skew Control bits 


EM_CRT_HZSKEW (RO) 


Emulation register for VGA CRT Controller End Horizontal Blanking Register — Display Enable Skew 
Control bits 


EM_CRT_HSYNCDELAY (RO) 


Emulation register for VGA CRT Controller End Horizontal Retrace Register — Horizontal Retrace 
Delay bits 


EM_CRT_DBLSCAN (RO) 


Emulation register for VGA CRT Controller Maximum Scan Line Register — 200 -> 400 Line 
Conversion bit 


EM_CRT_ROWSCAN (RO) 


Emulation register for VGA CRT Controller Preset Row San Register bits 6 — 0 (Byte panning and 
Starting Row Scan). 


Emulation Register 0x12 


Implicit Address 0x0207 
Reset Value 0x0000 


15-10 ee ee ee 7-0 


EM_CRT_LCOMP (RO) 


Emulation register for VGA CRT Controller Line Compare Register 
EM_CRT_LCOMP8 (RO) 
Emulation register for VGA CRT Controller Overflow Register — Line Compare Bit 8 


EM_CRT_LCOMP9 (RO) 


Emulation register for VGA CRT Controller Maximum Scan Line Register — Line Compare Bit 9 


Emulation Register 0x13 


Implicit Address 0x0208 
Reset Value 0x0000 


15-13 
EM_CRT_MAXSCAN EM_CRT_LINEWIDTH 
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EM_CRT_LINEWIDTH (RO) 
Emulation register for VGA CRT Offset Register — Logical Line Width of the Screen 


EM_CRT_MAXSCAN (RO) 
Emulation register for VGA CRT Controller Maximum Scan Line Register — bits 4 - 0 


Emulation Register 0x14 


Implicit Address 0x0209 
Reset Value 0x0000 


EM_CRT_SAL (RO) 


Emulation register for VGA CRT Controller Start Address Low Register 


EM_CRT_SAH (RO) 
Emulation register for VGA CRT Controller Start Address High Register 


Emulation Register 0x15 


Implicit Address 0x020A 
Reset Value 0x0000 


EM_CRT_VSE (RO) 
Emulation register for VGA CRT Vertical Retrace End Register bits 3 — 0. 


Emulation Register 0x16 


Implicit Address 0x020B 
Reset Value 0x0000 


a 
EM_CRT_VSS9 EM_CRT_VSS8 EM_CRT_VSS 


EM_CRT_VSS (RO) 


Emulation register for VGA CRT Controller Vertical Retrace Start Register 
EM_CRT_VSS8 (RO) 

Emulation register for VGA CRT Controller Overflow Register — Vertical Retrace Start Bit 8 
EM_CRT_VSS9 (RO) 

Emulation register for VGA CRT Controller Overflow Register— Vertical Retrace Start Bit 9 
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Emulation Register 0x17 


Implicit Address 0x020C 
Reset Value 0x0000 


7-0 
EM_CRT_VBE 


EM_CRT_VBE (RO) 


Emulation register for VGA CRT Controller Vertical Blank End Register 


Emulation Register 0x18 


Implicit Address 0x020D 
Reset Value 0x0000 


Posto] i 
EM_CRT_VBS (RO) 
Emulation register for VGA CRT Controller Vertical Blank Start Register 
EM_CRT_VBS8 (RO) 
Emulation register for VGA CRT Controller Overflow Register — Vertical Blank Start Bit 8 
EM_CRT_VBS39 (RO) 
Emulation register for VGA CRT Controller Maximum Scan Line Register— Vertical Blank Start Bit 9 


Emulation Register 0x19 


Implicit Address 0x020E 
Reset Value 0x0000 


oof te Pt 
EM_CRT_VDE9 EM_CRT_VDE8 EM_CRT_VDE 


EM_CRT_VDE (RO) 

Emulation register for VGA CRT Controller Vertical Display End Register 
EM_CRT_VDE8 (RO) 

Emulation register for VGA CRT Controller Overflow Register — Vertical Display End Bit 8 
EM_CRT_VDE9 (RO) 

Emulation register for VGA CRT Controller Overflow Register— Vertical Display End Bit 9 


Emulation Register 0x1A 


Implicit Address 0x020F 
Reset Value 0x0000 


wero | | Cid 
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EM_CRT_VT (RO) 

Emulation register for VGA CRT Controller Vertical Total Register 

EM_CRT_VT8 (RO) 

Emulation register for VGA CRT Controller Overflow Register — Vertical Total Bit 8 

EM_CRT_VT9 (RO) 

Emulation register for VGA CRT Controller Overflow Register— Vertical Total Bit 9 


Emulation Register 0x1B 


Implicit Address 0x0210 
Reset Value 0x0000 


er 
EM_CRT_HSE (RO) 
Emulation register for VGA CRT Controller Horizontal Sync End Register bits 4 - 0 


Emulation Register 0x1C 


Implicit Address 0x0211 
Reset Value 0x0000 


EM_CRT_HSS (RO) 
Emulation register for VGA CRT Controller Horizontal Sync Start Register 


Emulation Register 0x1D 


Implicit Address 0x0212 
Reset Value 0x0000 


a 
EM_CRT_HBE (RO) 
Emulation register for VGA CRT Controller Horizontal Blank End Register bits 4 — 0. 


EM_CRT_HBES5 (RO) 


Emulation register for VGA CRT Controller Horizontal End Horizontal Retrace Register — Horizontal 
Blank End bit 5. 


Emulation Register 0x1E 


Implicit Address 0x0213 
Reset Value 0x0000 


EM_CRT_HBS 
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EM_CRT_HBS (RO) 
Emulation register for VGA CRT Controller Horizontal Blank Start Register 


Emulation Register 0x1F 


Implicit Address 0x0214 
Reset Value 0x0000 


EM_CRT_HDE (RO) 
Emulation register for VGA CRT Controller Horizontal Display End Register 
Emulation Register 0x20 


Implicit Address 0x0215 
Reset Value 0x0000 


EM_CRT_HT (RO) 
Emulation register for VGA CRT Controller Horizontal Total Register 
Emulation Register 0x21 


Implicit Address 0x0216 
Reset Value 0x0000 


a 
EM_SEQ_MEMMODE 


EM_SEQ_MEMMODE (RO) 


Emulation register for VGA Sequencer Memory Mode Register. (Note top 4 bits and bottom bit are 
reserved). 


Emulation Register 0x22 


Implicit Address 0x0217 
Reset Value 0x0000 


EM_SEQ_MAPSEL EM_SEQ_MAPMASK 


EM_SEQ MAPMASK (RO) 
Emulation register for VGA Sequencer Map Mask Register. (Note top 4 bits are reserved). 
EM_SEQ_ MAPSEL (RO) 


Emulation register for VGA Sequencer Character Map Select Register. (Note top 4 bits are reserved). 
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Emulation Register 0x23 


Implicit Address 0x0218 
Reset Value 0x0000 
15-8 7-0 


EM_SEQ_RESET (RO) 

Emulation register for VGA Sequencer Reset Register. (Note top 6 bits are reserved). 
EM_SEQ_CLOCKMODE (RO) 

Emulation register for VGA Sequencer Clocking Mode Register. (Note top 2 bits are reserved). 


Emulation Register 0x24 


Implicit Address 0x0219 
Reset Value 0x0000 


7-0 
EM_Misc 


EM_MISC (RO) 


Emulation register for VGA Miscellaneous Output Register. 
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ROM & GPIO INTERFACE 


The ROM and GPIO interface is an 8-bit data port with a 128k address range shared between ROM, 
VMI host and general purpose IO. 


The sharing of the memory space is controlled by a pin ROMIO, which when set high means that the 
access is intended for VMI / GPIO space and when set low means the access is for ROM space (it is 
typically wired to ROMCS#). 


Accesses to the interface through the Expansion ROM Base Address memory space always result in 
an access on the bus with ROMIO set low so that the ROM is accessed. 


Accesses to the interface can also be made by the Primary Core fetching instructions which is 
performed through a separate arbiter and also always accesses the ROM space. 


ROM Offset 


Implicit Address 0x0288 
Reset Value 0x0100 


ROM_OFF 


ROM_OFF (RW) 


The 14-bits specified for the ROM offset are the top 14-bits of a 32-bit byte address. The top 14 bits of 
instruction fetch addresses issued from the Primary Core are compared against this value, and if they 
match, the request is issued to the ROM interface and the top 15 bits of the address removed. This 
allows 128K ROM to be located on any 256K boundary in the 4GB address space. These bits are set 
to 0x0100 on hard reset which means that ROM addresses start at the 64MB address boundary - just 
above the frame buffer (which is mapped to the region 0-64M on reset). Note that this also means that 
the Primary Core cannot access anything in the 128K region above the top of ROM / lO space. 


GPIO Accesses 


ROM Control 
Implicit Address 0x00B8 
Reset Value 0x0000 


Pte te fl 
ose || Romio _|_ROM_ADDRtE 


ROM_ADDR‘16 (RW) 


Top address bit used on the ROM / IO interface when accessing the interface through Implicit 
registers. 


ROM_IO (RW) 


Selects between ROM and GPIO 128k address spaces when accessing the interface through Implicit 
registers. 


0 Access ROM space (low 128k) 
1 Access GPIO space (high 128k) 


Accessing GPIO space results in the ROMIO pin being asserted high and the VMICS# pin asserted 
low. 
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These 4 bits act as a Byte Enable mask for writes of up to 32 bits through the GPIO port using Implicit 
registers. Which bytes of the ROM Data registers are written depends on the mask (reads are always 
4 8 bit reads). The least significant byte is written to the address in the ROM Address register if the 

associated mask bit is set, the next byte is written to (byte address + 1) if the next bit is set, and so on. 


ROM Read Address 


Implicit Address 0x00B9 
Reset Value 0x0000 


ROM_READ_ADDR_LO 


ROM_READ_ADDR_LO (RW) 


Writing to this register causes four bytes to be read from the ROM IO port at the specified address and 
up to three successive byte addresses. The data read is returned in ROM_DATA_LOW and 
ROM_DATA_HIGH. 


Note: ROM_READ_ADDR_LO is always DWORD aligned and reads occur from the highest byte 
address to the lowest byte address (e.g. address+3, address+2, address+1 and address). 


ROM Write Address 


Implicit Address Ox00BA 
Reset Value 0x0000 


ROM_WRITE_ADDR_LO 


ROM_WRITE_ADDR_LO (RW) 


Writing to this register causes between one and four bytes to be written to the ROM IO port at the 
specified address and up to three successive addresses. The data written is that stored in 
ROM_DATA_LOW and ROM_DATA_HIGH. 


ROM Data Low 


Implicit Address 0x00BB 
Reset Value 0x0000 


ROM_DATA_LO 


ROM_DATA_LO (RW) 


Low 16 bits of data returned from or written to the ROM IO port. 
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Implicit Address Ox00BC 
Reset Value 0x0000 


ROM_DATA_HI 


ROM_DATA_HI (RW) 
High 16 bits of data returned from or written to the ROM IO port. 
VMI Host Accesses 


VMI Host 0 
Offset 0x00D0 
Reset Value 0x0000 


31-24 23-16 
VMI_DATA_3 VMI_DATA_2 VMI_DATA_1 VMI_DATA_0 


VMI Host 1 
Offset 0x00D4 
Reset Value 0x0000 


23-16 
VMI_DATA_7 VMI_DATA_6 VMI_DATA_5 VMI_DATA_4 


VMI Host 2 
Offset 0x00D8 
Reset Value 0x0000 


23-16 
VMI_DATA_B VMI_DATA_A VMI_DATA_9 VMI_DATA_8 


VMI Host 3 
Offset 0x00DC 
Reset Value 0x0000 


31 = 2A 23-16 
VMI_DATA_F VMI_DATA_E VMI_DATA_D VMI_DATA_C 


Allows direct access to the bottom 16 byte locations of GPIO space directly from the host. When 
accessed using this method, the VMICS# pin is driven low and the ROMIO pin high. 


ll1C Registers 


lIC Interface A 
Offset 0x0050 Implicit Address 0x0118 
Reset Value 0x0000 


18-4 a a ee 
Reserved ACLK_IN ADAT_IN | ACLK_OUT | ADAT_OUT 


This register is used for driving one of the PowerVR 250 IIC busses. This bus is used for video in 
control of video decoders or MPEG decoders. IIC is an active low bus with pull ups, hence setting the 
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data or clock to 0 drives the bus, whereas setting them to 1 releases the bus, although another agent 
may still drive it. 


IC Interface B 
Offset 0x0054 Implicit Address 0x0119 
Reset Value 0x0000 


15-4 3 2 1 | oo 


This register is used to drive the second PowerVR 250 IIC bus. This bus is used for monitor detection 
using DDC. It acts in the same way as bus A described above. 


ADAT_OUT, BDAT_OUT(RW) 

0 drives data line low. 1 releases the data line. 
ACLK_OUT, BCLK_OUT (RW) 

0 drives clock line low. 1 releases the clock line. 
ADAT_IN, BDAT_IN (RO) 

Reads current value of IIC data line. 

ACLK_IN, BCLK_IN (RO) 


Reads current value of IIC clock line. 
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PRIMARY CORE 


The primary core is responsible for processing requests from the host and initiating 2D operations as 
well as handling other data input and output from the DAC, TV and Video In feeds. It is a microcoded 
engine which executes up to four separate threads of microcode. The background thread (thread 0) is 
responsible for host communication and control of the 2D core and three real time threads which are 
each responsible for one of the data feeds: The DAC Feed uses thread 3, the TV Encoder thread 2 
and Video In thread 1. 


Thread Enable 0 


Offset 0x0000 Implicit Address 0x0000 
Reset Value 0x0001 


Thread Enable 1 


Offset 0x0004 Implicit Address 0x0001 
Reset Value 0x0000 
15=1 


Reserved EN_T1 


Thread Enable 2 \ 


Offset 0x0008 Implicit Address 0x0002 
Reset Value 0x0000 
ENT2 
Thread Enable 3 
Offset 0x000C Implicit Address 0x0003 
Reset Value 0x0000 


15-1 


Reserved EN_T3 


EN_TO, EN_T1, EN_T2, EN_T3 (RW) 


All these registers operate in the same manner and enable or disable their respective threads. They 
do not destroy any state of the thread. 


Program Counters for T1, T2 and T3 can only be set by the Primary Core, so TO normally initialises 
the other threads. Thread 0 starts at it’s current program counter, which is initialised at reset (hard or 
soft) to the value held in the Reset Vector register. 


Thread Recover 0 


Offset 0x0020 Implicit Address 0x0008 
Reset Value 0x0000 


Pr 
REO_To 
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Thread Recover 1 


Offset 0x0024 Implicit Address 0x0009 
Reset Value 0x0000 
15 | oo 
Thread Recover 2 
Offset 0x0028 Implicit Address Ox000A 
Reset Value 0x0000 
ee es er ee 
Thread Recover 3 
Offset 0x002C Implicit Address 0x000B 
Reset Value 0x0000 
a 


REC_T0, REC_T1, REC_T2, REC_T3 (RW) 


All these registers operate in the same manner and are used to retrieve a stuck thread or to single 
step through primary core code. Setting the bit causes the relevant thread to single step 1 clock 
ignoring interlocks (including waits on memory or co-processor) as well as terminating repeat 
instructions. To step again the register must be set to 0 and again to 1. 


Thread Kicker 0 


Offset Ox00BO Implicit Address 0x0100 
Reset Value 0x0000 

Thread Kicker 1 

Offset 0x00B4 Implicit Address 0x0101 
Reset Value 0x0000 


Kick_T1 


Thread Kicker 2 


Offset 0x00B8 Implicit Address 0x0102 
Reset Value 0x0000 

Thread Kicker 3 

Offset 0x00BC Implicit Address 0x0103 
Reset Value 0x0000 


15-0 


Kick_T3 
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Kick_T0, Kick_T1, Kick_T2, Kick_T3 (WO) 


All these registers operate in the same manner. Writes increment the internal primary core kicker 
register for the relevant thread. The low 15 bits are added to the internal register value and the top bit 
is ORed into the register (this top bit may be used to signal an asynchronous event allowing the 
primary core to process a high priority task). 


Primary Core Reset Vector Low 
Offset 0x0040 Implicit Address 0x0010 
Reset Value OxFFF8 
15-0 
RST_VECTORLO 
Primary Core Reset Vector High 
Offset 0x0044 Implicit Address 0x0011 
Reset Value 0x0403 
15-0 
RST_VECCTORHI 


RST_VECTORLO, RST_VECTORHI (RW) 


These two registers are the Primary Core Reset Address. They are loaded into Thread 0’s program 
counter after a hard or soft reset of the primary core. They should be programmed to (reset address-4) 
i.e. a pre decremented address. 


The reset address of 0x0403FFF8 causes thread 0 to vector to the top of ROM space on a hard reset 
fetching the last (4 byte) instruction in the ROM image. (see ROM Offset) 


Only the top 30 bits of the reset vector are used hence the address is DWORD aligned. 
Primary Core Soft Interrupt 


Offset 0x0048 Implicit Address 0x0012 
Reset Value 0x0000 


15-4 ee eae eee ee 
Reserved C1IRQ3 C1IRQ2 C1IRQ1 C1IRQO 


C1IRQ3, C1IRQ2, C1IRQ1, C1IRQO (RW) 


These registers allow the primary core to cause an interrupt to the host (and as a side effect, the host 
to cause interrupts from PowerVR 250). In common with all interrupting sources into the interrupt 
register, they trigger an interrupt on a rising edge, hence must be cleared and then set to generate an 
interrupt. 


Primary Core Video Clock Div 
Offset 0x004C Implicit Address 0x0013 
Reset Value 0x0000 


in=2 1-0 


crvib DIV 


C1VID_DIV (RW) 


Primary core Video In clock divider register. This allows the video reference clock from the video 
encoder to be divided down and used as a timing reference for the primary core. 
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If the TVOut module is fed with a 54MHz clock, the input to the divider is 27MHz. 


00b Divide input by 1 
01b Divide input by 2 
10b Divide input by 3 
11b Divide input by 4 


Since boards designed without TVOut may use 14.3 MHz as the reference clock (i.e. 7MHz into the 
divider) then it is recommended that for a 54MHz reference clock a divide ratio of 4 is used to provide 
an equivalent timer reference. 


The reference timer (post divider) must run at less than half the core clock frequency for correct 
operation. Hence if the CoreClock is reduced for power management then the operation of this timer 
cannot be guaranteed. 


The output of the divider may be read from one of two primary core registers in the timer block. The 
first is a 32bit register and the second a 24bit register (genlock register). Both are reset using the reset 
signal used by the TVOut block hence neither timer will work if the TVOut is held in reset. 


2D CORE 


The 2D core performs 2D pixel operations including BLTs, color space conversion, motion 
compensation, 2D blend operations and ROPs. The host interfaces to it through a memory based 
circular command buffer (CCB) in which commands are written and then processing is instigated by 
the host writing to the primary core thread 0 kicker register. 


The host interface is described in [9]. 
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3D CORE 


The PowerVR 250 is a deferred rendering device using tile based display lists. The render surface is 
broken down into 32x16 tiles (or regions) each one having its own display list containing the data for 
the objects within that tile. Therefore, before a render can start, the object data for the whole scene 
must be ‘tiled’, i.e. a bounding box for each object must be calculated and then the object inserted into 
the display lists for the tiles that overlap the box. This task is performed by the tile accelerator. To 
achieve some of the more complex visual effects seen in 3D scenes, the 3D core also supports 
multiple passes on each tile. 


The normal steps to rendering 3D scenes are: 


Host software (application and driver) creates polygon data, creates and loads associated textures 
and prepares the scene data for the Tile Accelerator and 3D core to process. 


The Tile Accelerator read the 3D parameters and creates a tiled representation of them in 
framebuffer memory. 


The ISP reads each tile in turn. For each one it reads the 3D object geometry data and performs 
hidden surface removal (HSR). 


The ISP passes tags to the TSP for all visible parts of polygons. The TSP reads the 3D object data 
and associated textures and textures and blends the resultant pixels. Rendered pixels are stored in an 
accumulation buffer to allow arbitrary numbers of layers to be added to objects before the final pixels 
are written out to framebuffer. 


Tile Accelerator 


The tile accelerator supports two modes of operation. Copy Through Mode is the default mode which 
copies the entire scene object data into frame buffer memory and hence requires a relatively large 
buffer. If a buffer of required size is unavailable then Full Macro Tiling mode is used where the scene 
is first partitioned into macro tiles before being processed by the tile accelerator. 


Copy Through Mode 


In Copy Through mode the TA reads a control stream consisting of ISP/TSP control words followed by 
object pointers for all objects that share those words. The TA reads the raw object data the pointers 
reference from Host memory, applies small object culling, scaling the X and Y co-ordinates if super 
sampling is enabled and inserts the ISP and TSP control words and writes the resulting object data 
into frame buffer. 


Full Macro Tiling Mode 


A macro tile is a tile that is composed of an array of hardware tiles (32x16 pixels). The scene is divided 
into macro tiles which are processed by the TA and rendered in turn. This minimises the framebuffer 
memory requirements for parameters, but otherwise operates in exactly the same manner as Copy 
Through Mode. 
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Host Buffers 


Object 
Pointers & 

Control | E 

Words Object 
Data 
I Tile Accelerator —aee 
I 
| 

Object | 

Data I ; 
Frame Buffer 


TA 
Instructions; Status 


Object 
Scene/Macro Bo 
tile control Primary Core 
blocks 
|---| 
Region 
Headers 


Table 6. Tile Accelerator Data Processing 


TA / Primary Core Interface 


The Tile Accelerator and Primary Core interact in two main ways. Firstly the Primary Core is 
responsible for setting up and starting the Tile Accelerator by writing instruction registers to it and 
secondly the primary core is used to generate the region header and object pointers in frame buffer 
memory based on data output from the Tile Accelerator. 


The Tile accelerator / primary core interface is detailed in [8] 


The Tile Accelerator instruction registers are only accessible by the primary core writing to it’s Tile 
Accelerator port. The address of each register is encoded in the 64-bits of instruction sent. 


TA Instruction 0a 


Reset Value 0x0000 
63 - 52 51-32 31-29 28 - 20 
TA_CTRL_PAGE_ADDRY 000 (ad TA_CTRL_PAGE_ADDRO 


TA Instruction 0b 


Reset Value 0x0000 
63 - 52 51-32 31-29 28 - 20 
TA_CTRL_PAGE_ADDR3 001 (Addr) Reserved TA_CTRL_PAGE_ADDR2 


TA Instruction 0 


Reset Value 0x0000 
62-32 31-29 28-12 


ele 
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TA Instruction 1 


Reset Value 0x0000 


s-32 | s1-30 | 29-21 


TA_COPY_DEST_ADDR Reserved Reserved TA_COPY_COUNT 


TA Instruction 2 


Reset Value 0x0000 


63 - 62 61-56 55 — 54 53 - 48 46 - 40 38 - 32 


MTILE_MINX MTILE_MAXX MTILE_MINY MTILE_MAXY 


31-30 29-24 22-16 
10 (Ade TA_SOREEN_W TA_SCREEN.H 


Lo | es a | ee ne 
TA_AA_Y | TA_SHAD | TA_COPY TA_CULL | TA_HALFCULL | TA_AA_X | TA_MTILE_OLAP 


TA Instruction 3 


Reset Value 0x0000 
63 - 62 61-56 55-54 53-48 46 — 40 38 — 32 
TA cuP| —CLIP_MINX CLIP_MAXX CLIP_MINY CLIP_MAXY 


31-30 29-0 
14 (Adar) 


TA_CTRL_PAGE_ADDR 0 - 3 (WO) 


These registers provide the address of the four physical pages in host memory which contain the Tile 
Accelerator Control List. 


TA_CTRLWORD_COUNT (WO) 


This is the number of DWORDS in the Tile Accelerator Control List. When started, the Tile Accelerator 
runs through this list for up to the maximum 4k DWORD entries. If more entries need to be processed 
by the Tile Accelerator, these must be done in a separate pass (though they can be added to the 
same scene for rendering purposes). 


TA_COPY_DEST_ADDR (WO) 


This is the base address at which the Tile Accelerator starts to copy objects. It is a 30 bit DWORD 
aligned address allowing full 4GB addressability. 


TA_COPY_COUNT (WO) 


This is the maximum number of object DWORDS the Tile Accelerator may write i.e. it represents the 
maximum amount of memory that has been allocated for object data. If the TA reaches this limit during 
tiling, it signals an overflow back to the Primary Core in the TA Status word (see below). It’s 21bits 
represent a maximum object space limit of 8MB. 
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TA_SCREEN_W, TA_SCREEN_H (WO) 


Sets the screen width and height in number of tiles. A tile is a fixed size of 32x16 pixels in PowerVR 
250 hence the maximum width is (32 x 64) = 2048 and a maximum height of (16 x 128) = 2048. 


MTILE_MINX, MTILE_MAXX, MTILE_MINY, MTILE_MAXY (WO) 
Sets the macro tile region for the TA. This can range from a single tile up to the full screen. 
TA_COPY (WO) 


Determines whether the Tile Accelerator will copy object data into the framebuffer or not. 


0 no copy through 
1 copy through 
TA_Z (WO) 
Determines whether the objects being processed have Z and 1/W defined in their vertices, or just 1/W. 
0 1/W only 
1 Z& 1/W 


TA_CULL (WO) 


Determines whether the TA performs small object culling or not. Small object culling removes objects 
which do not cross the centre of a pixel i.e. they should not light any pixels at all. 


0 Small object culling off 
1 Small object culling on 


TA_HALFCULL (WO) 


Alters the definition of the pixel centre from the co-ordinate intersection — X and Y co-ordinates are 
both integers to the point where the X and Y co-ordinates both have a fractional part of 0.5. 


0 Pixel centre at integer co-ordinates 
4 Pixel centre at 0.5 fractional co-ordinates. 


TA_SHAD (WO) 


Enables cheap shadow support in the Tile Accelerator. 
0 Cheap shadow support enabled 
1 Cheap shadow mode disabled 
TA_AA_X, TA_AA_Y (WO) 


Turns full scene anti-aliasing support on or off in the TA. When on, the TA will create a super-sampled 
screen representation in frame buffer by scaling all the input co-ordinates up by 2 in X or Y 
respectively. 


0 Anti-aliasing support in X/Y off 
1 Anti-aliasing support in X/Y on 


TA_MTILE_OLAP (WO) 


Turns on Macro tile overlap detection. This is broken and should not be used. 


0 Macro tile overlap off 
1 Macro tile overlap on 
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Controls the TA’s tile aligned clipping. This can be used to create a display list for a region of the 
screen clipped to a tile aligned bounding box. The clipping is performed after the tiling operation, 
immediately prior to writing object data. 


00b No clipping 

01b All clipping — clips all output 

10b All outside clipped — clips all tiles outside the defined region 
11b All inside clipped — clips all tiles within the defined region 


CLIP_MINX, CLIP_MAXX, CLIP_MINY, CLIP_MAXY (WO) 


Defines the tile clip region. 
Note: Sending Instruction 0 to the TA implicitly starts the tiling process. 
TA Status 0 


62-92 
| oo | TA_OBJECT_PTR 


29-13 
TacoL | TACO TALTILENUM 


A TA Status 0 word is sent from the TA to the Primary Core for every object processed. 
TA_OBJECT_PTR (RO) 
The TA Object pointer is returned in the 3D internal data format (see Object Pointers). 


TA_TILENUM (RO) 


This is the tile number in which the object resides. It is a zero based number relative to the top left of 
the macro tile being processed. 


HW 
Tile 


Macro Tile 


Table 7. Tile Numbering in a Macro Tile 


TA_EOL (RO) 


End of List indicator. Once set, the Tile Accelerator has completed processing the display list. 
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TA_CO (RO) 


Copy through Overflow. This bit is set if the TALCOPY_COUNT limit is reached. When set, the 
TA_EOL bit is also set. 


TA Status 1 


63 62 — 32 


TA_OLAP_OBJECT_PTR 


| a1 | ao | we | 5 | « | s [| 2 fa [To | 
tat | ta.po | taro | Tago | TauR | TAUB 


A TA Status 1 word is used in TA Macro Tiling mode. This is not used in PowerVR 250. 


~~ TA Data Format 


The TA_CTRL_PAGE_ADDRO — 3 pointers address a TA control list in host memory which in turn 
points to vertex data as illustrated below. 


[1] Object Ptr 


TA Control List 
(up to 4 physical 
pages) 


—> 
Next Page 
ISP / TSP Hdrs 
aa 


(3/5 DWORDS) 


Object Ptr 
Object Ptr 
Object Ptr 


Lp» 


Vertex Data 


Object Ptr 
Object Ptr 
Current Page 
Next Page 


ISP / TSP Hdrs 
(3/5 DWORDS) 


Object Ptr 
Object Ptr 
Object Ptr 


Table 8. Tile Accelerator Input Data 


TA Control List format 


The control list buffer contains pointers to the objects in the scene and some control data. 


The object pointers are in the same format as used by the ISP and TSP. The top bit is used to tell the 
Tile Accelerator the current and next physical page addresses in which object (vertex) data is located 
as well as the ISP and TSP control words to be used for that data and hence is set at every 3D state 
change or when object data crosses page boundaries. 


Note: The escape pointer block (an object pointer with top bit set, page pointers and ISP/TSP control 
words) must not cross a physical page boundary. 
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Internal 3D Data Format 


The internal 3D data format is processed directly by the ISP and TSP and represents the entire scene 
in a tiled format where each tile (32x16 pixels in PowerVR 250) is referenced by a region header as 
shown below. 


Region Header Array Object Pointers Objects 


Region Header 


> Object Pointers 


Region Header 


Lat 
Region Header > 
-—>} 


Region Header — 
> Object Pointers +} 


Objects 


Region Header 


—p Object Pointers -_———$ 


Table 9. Internal 3D data format 
Region Header 


The Region Header Array is an array of Region Header Blocks in a contiguous area of memory which 
represent all the objects in a scene. It’s address is defined by the REGION HEADER BASE and 
REGION HEADER OFFSET registers. 


One (or more) Region Header blocks are defined for each screen tile and consists of five 32 bit words, 
containing a header word and 4 pointers as follows:- 


Region State Buffer Position Position 


Translucent List Pointer 
Translucent Modifier Volume List Pointer 


Last Region 


This marks the end of the rendering region data. It must be set on the last region. 


Keep Z State 


If this bit is cleared the state of the Z buffer is reset by the insertion of a background plane before any 
other planes are processed. If set, this plane is not inserted and the Z state of the ISP is not cleared. 
This allows the ISP to perform multiple passes on the same tile whilst keeping the Z state from 
previous passes. 


PreSort 


When cleared, translucency auto sorting is used and when set it is not, requiring that the driver 
provides pre sorted translucent objects. 
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Flush Acc Buffer 


The accumulation buffer in the TSP is normally flushed between the processing of regions. This is 
done with an End-Of-Tile flag sent by the ISP to the TSP. In some cases, it may be desirable to 
perform an extra pass on objects in the same region and maintain the state of the accumulation buffer 
while doing so. This can be achieved by clearing this bit which gates the End-Of-Tile flag. If the bit is 
set the accumulation buffer is flushed as normal. 


Region Y & X Position 


This specifies the top left pixel position of the region. The supplied Y and X values are multiplied by 16 
and 32 respectively. 


List Pointers 


The format of each of the four list pointers is as follows: 


Empty PTR Pointer to Object List | 00s 


The Pointer To Object List field is the address of the relevant (opaque, translucent etc.) object 
pointer list which is aligned on a 32bit word boundary. The pointer and 3D Object Pointer Offset 
register together address a 4GB range. An empty list is identified by a ‘1’ in the Empty PTR field in 
which case all other bits in the list pointer are ignored. 


Object Pointers 


An array of object pointers describe objects potentially visible in a particular tile. The array is in pages 
of memory linked by a special Object Pointer type. 


[| Mask Shadow Skip Triangle Strip Start 


The 6 bit mask allows any combination of triangles in a strip to be added to a region. For example if 
‘TO’ and ‘T3’ are set, then the first and fourth triangles in the strip are visible in that region. 


Mask 


Shadow 


The Shadow bit communicates that the surface has inside/outside information attached. In ‘Intensity 
Volume Mode’, this bit describes whether the surface accepts a shadow. 

Skip 

The Skip field communicates the size of a vertex, so random access into the strip is possible. 


If the Shadow bit is not set, then the size of a vertex is ‘Skip + 3’ words long. If the Shadow bit is set 
and PowerVR 250 is in ‘Parameter Selection Volume Mode’ , then the size of a vertex is ‘Skip*2 +3’ 
words long. 


Triangle Strip Start 


Address bits 22 — 2 of the byte address of the triangle strip object which must be aligned on a 32bit 
word. Bits 27 — 20 are defined in the Object Base register and bits 31 — 28 in the 3D Address Base 
register. 
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Object Pointer Block Link 


This register is used to link blocks of object pointers in separate physical pages together. It is only 
required when object pointers are stored in host memory or to terminate the list of objects. 


Bits 31-29 28 27. 
n 


2 
qt End of Next Pointer Block 
List (32bit Word Address) 


Marks the end of the object pointer list. 


End of List 


Next Pointer Block 


Bits 27 — 2 of the next block containing object pointers. Bits 31 — 28 are defined by the jError!No se 
encuentra el origen de la referencia. register. 


3D Object Data 


Illustrated below is the complete list of 3D parameter formats that supported by PowerVR 250. All 
objects may be represented by two forms, one using X, Y and 1/W to describe a vertices position and 
a second using X, Y, Z and 1/W. In the second case Z is used for depth sorting and 1/W is used for 
texturing calculations, the vertex format used is determined by the ISP_ZSORT bit in the ISP Sort 
Control register. 


In each case the format described is a triangle strip with between 3 and 8 vertices (a strip of 1 to 6 
triangles). The vertex specific data is shaded and must be repeated for each vertex. 


Shadow Triangle 


Format #1 Format #2 


ISP/TSP Instruction Word 
TSP Instruction Word (ignored) 
Texture Control Word (ignored) 


ina 


Base Color : : | omc | 


Gouraud Shaded Triangle (No Offset Color) 


F oxte | 
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Gouraud Shaded Triangle (Offset Color) 


ISP/TSP Instruction Word ISP/TSP Instruction Word 
TSP Instruction Word TSP Instruction Word 
Texture Control Word (ignored Texture Control Word (ignored) 


| | 070 


ISP/TSP Instruction Word 
TSP Instruction Word 
Texture Control Word 


ISP/TSP Instruction Word 
TSP Instruction Word 
Texture Control Word 


- 


sai 


32 bit UV Textured Gouraud Shaded Triangle (No Offset Color) 


ISP/TSP Instruction Word ISP/TSP Instruction Word 
TSP Instruction Word TSP Instruction Word 
Texture Control Word Texture Control Word 


Ls es ce ae SANG 
: — : ae 
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16 bit Textured Gouraud Shaded Triangle (Offset Color) 


: < 


ISP/TSP Instruction Word 
TSP Instruction Word 
Texture Control Word 


| 0100 | ISPITSP Instruction Word _| 
| ovo | TSP Instruction Word | 
| 

| ont | 


ISP/TSP Instruction Word 
TSP Instruction Word 
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32 bit Bump Map Triangle. 


ISP/TSP Instruction Word 
TSP Instruction Word 
Texture Control Word 


Gouraud Shaded Triangle With Two Volumes (No Offset Color) 


ISP/TSP Instruction Word ISP/TSP Instruction Word 
TSP Instruction Word 0 TSP Instruction Word 0 


Texture Control Word O(ignored) Texture Control Word O(ignored) 
TSP Instruction Word 1 TSP Instruction Word 1 
Texture Control Word 1(ignored) Texture Control Word 1(ignored) 


z 3 <= =< 
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16 bit UV Textured Gouraud Shaded Triangle With Two Volumes (No Offset Color) 


_ 


SS 


ee Base: i | we 


32 bit UV Textured Gouraud Shaded Triangle With Two Volumes (No Offset Color) 


ee 


ee 
a s 
- Le 
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16 bit Textured Gouraud Shaded Triangle With Two Volumes (Offset Color) 


TSP Instruction Word 0 
Texture Control Word 0 
, , 

| ox20_| 
| oe _| 


0x34 


ISP/TSP Instruction Word 


TSP Instruction Word 0 


Texture Control Word 0 


TSP Instruction Word 1 


oxt0 


0x14 


Texture Control Word 1 0x10 


: . Y 
VC 
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32 bit Textured Gouraud Shaded Triangle With Two Volumes (Offset Color) 


| oxo0 | ISP/TSP Instruction Word ISP/TSP Instruction Word 
TSP Instruction Word 0 TSP Instruction Word 0 


0x08 Texture Control Word 0 Texture Control Word 0 
TSP Instruction Word 1 TSP Instruction Word 1 
| oxto Texture Control Wor Texture Control Word 1 


| ont | 
| oxte | 
| 0x20 _| 
| oxza | 


0x24 


| 0x28 
| 0x26 
| 0x30 


32 bit Bump Map Triangle With Two Volumes 


ISP/TSP Instruction Word 
TSP Instruction Word 0 


ISP/TSP Instruction Word 


TSP Instruction Word 0 


Texture Control Word 0 | oxos | 


TSP Instruction Word 1 
Texture Control Word 1 


Texture Control Word 0 
TSP Instruction Word 1 
Texture Control Word 1 


0x08 


0x14 
0x18 
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Vertex Order 


The following diagram shows a triangle strip of length 4. 


The Order that the vertices are stored is ABCDEF. Triangle1 is defined by ABC, Triangle 2 is defined 
by CBD (not BCD), Triangle 3 by CDE, and Triangle 4 by EDF (not DEF). This then guarantees that 
the ordering of the points (clockwise in this example) is consistent between all triangles. 


X, Y, 1/W, Z Data 
These are described in IEEE single precision floating point format. The X and Y are the screen co- 


ordinates of the vertex, Z refers to the vertices untransformed depth value, and 1/W to the inverse of 
it's perspectively projected depth value. 


Color Data 


The colors are packed into 32 bits, with the following format: 


Green (8bits) Blue (8bits) 
MSB LSB 
Note 1: an Alpha of 0 will implies fully translucent, and an Alpha of 255 is fully opaque. 
Note 2: The Alpha component in the Offset color can be used as Vertex Fog. 
UV Texture Coordinates 
These values are described in two possible formats. If the ‘16 Bit UV’ bit in the ‘ISP/TSP Instruction 
Word’ is clear then the UV pair are in IEEE single precision floating point format. If the ‘16 Bit UV’ bit is 


set, then a UV pair is placed in one 32 bit word by taking the top 16 bits of each IEEE float. The U is 
stored in the most significant 16 bits, and the V is stored in the least significant 16 bits. 


Co-ordinates outside the range 0.0 to 0.9999 will be in the Flip/Clamp/Wrap region. 

Two Volumes format 

If the shadow bit in the pointer is set, and ‘Parameter Selection Volume Mode’ has been selected, 
then data for two volumes will be supplied. If the surface is outside the defined volume, then the 


parameters with ‘0’ appended to the name will be selected. If the surface is inside the defined volume, 
then the parameters with ‘1’ appended to the name will be selected. 


Bump Mapping format 


The K1K2K3Q parameters are taken from the third vertex (in the same manner as flat shaded 
surfaces). A complete description of Bump Mapping is described in [6]. 


ISP/TSP Instruction Word 


The ISP/TSP Instruction word is set out as follows: 
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Depth Culling Depth Texture Offset Gouraud | 16 Bit UV Cache D Calc Reserved 
Compare Mode Write shading Bypass Control 
Mode Disable 


If the surface is in one of the Modifier Volume Lists, the word is interpreted as follows: 


Bits 31-29 


Volume Culling Reserved 
instruction Mode 


Depth Compare Mode 


The depth compare mode in combination with the ‘Depth Write Disable’ field is used to support the 
various OpenGL and D3D style of Z-buffer updates. Remember that the ‘Z’ refers to either Z or inverse 
Z (or 1/W) values depending on the operating mode and the depth compare may need to be changed 
as a result. 


Depth Function Code 
Never 0 
Less 1 
Equal 2 
Less Or Equal 3 
Greater 4 
Not Equal 5 
Greater Or Equal 6 
Always vi 


Culling Mode 


To support arbitrary back-face culling, the culling mode field defines the following modes: 


Cull mode Code 
No culling 0 


| Cull if Small 


1 
Cull if Negative 2 
Cull if Positive 3 


The first is self-explanatory. The remaining three use the register, FRU CULL VALUE, which is set up 
by the software at the start of rendering. The settings have the following behaviours :- 


Action 
Cull if (|det| < FPU CULL VALUE) 


Cull if (|det| < 0) or 

(|det| < FPU CULL VALUE) 
Cull if (|det| > 0) or 
(|det| < FPU CULL VALUE) 


Cull mode 
Cull if Small 
Cull if Negative 


Cull if Positive 


This allows the ISP to eliminate very small triangles, as well as perform the usual back-face culling. 


Depth Write Disable 


If the Depth Write Disable flag is set, the depth test occurs as normal but the depth value is not 
updated even when the polygon is visible - only the ‘Tag’ register for the pixel is updated. It is used to 
implement OpenGL Depth Masking. 
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Texture Bit 


This bit specifies if the surface is textured or just shaded. 


Offset 


If set, then the color offset values are relevant, and must added into the shading calculation. If the 
Gouraud shading, it must be interpolated. If zero, the offset color can be regarded as equivalent to 
zero. 


Gouraud Shading 


If set, the three colors associated with a triangle will have perspective correct interpolated color across 
the surface. If this bit is 0, then the color of the third vertex will be used over the whole surface. 


16 bit UV 


If the ‘Texture’ bit is set and the ‘16 bit UV’ bit is set, then a UV pair is placed in one 32 bit word by 
taking the top 16 bits of each IEEE float. The U is stored in the most significant 16 bits, and the V is 
stored in the least significant 16 bits. 


Cache Bypass. 


If this bit is set, then the TSP parameter cache will be completely bypassed. 


DCalc Control. 


This is used to control the method of calculating ‘D’ (or Level Of Detail) for MIP-mapping. If the ‘Dcalc 
Ctrl’ bit is 0, then the gradient calculations are as follows:- 


ar — pc 
dudx = ———_——— 
w= (pX +q¥ try 
dr- 
Aeon — oe 
(pX +q¥ +r) 
Ailes br —qe 
” ~* (pX +q¥ +r) 
er —qf 
aay = (pX+q¥+ry 


‘a’, ‘b’, ‘c’, ‘d’, ‘e’, ‘f, ‘p’, ‘q’, and ‘r’ are the texture mapping coefficients. 


‘X’ and ‘Y’ are the screen coordinates. 
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If the ‘Dcalc Ctrl’ bit is 1, then the gradient calculations are as follows:- 


a( pX'+qY'+r) — p(aX'+bY'+c) 


dudx = : > 
(pX +qY¥ +r) 
d(pX'+qY'+r)— p(dX'+eY'+f) 
dvdx = = 
(pX +q¥ +r) 
b( pX'+qY'+r) —q(axX'+bY'+c) 
dudy = 5 
(pX +q¥ +r) 
e(pX'+qY'+r)—q(dX'+eY'+f) 
= 
(pX +q¥ +r) 


X dash and Y dash are the screen co-ordinates of the first vertex of the surface. 


Volume Instruction. 
PowerVR 250 will support inclusion and exclusion volumes constructed entirely from triangles. The 
type of these volumes must be identified on a per-region basis via a special instruction on the triangle. 


Since Depth comparisons must be consistent, the polygon type instruction information replaces the 
Depth Function bits. Since the lists of normal objects and Modifier Volume objects are separate, there 
is no ambiguity. The instruction values are :- 


Volume Instruction 


‘Normal’ Polygon 


Inclusion Last Poly 1 


Exclusion Last Poly 2 


Reserved 


The ‘Last polygon’ instructions indicate that this is the last polygon for a given volume in a region, 
while the ‘normal polygon’ is just one of the polygons forming the boundary of the ‘current’ volume. 


The culling mode bits are the same as for visible polygons, but culling anything except tiny polygons 
may be dubious. 


TSP Instruction Word 


The TSP Instruction word is set out as follows: 


Bis3120 | 2026 23-22 


SRC Alpha DST Alpha | SRC Select | DST Select | Fog Control Color Use Alpha Ignore Tex 
Instr Instr Clamp Alpha 


Bis 18:17 | Bits 16-15 


SRC / DST Alpha Instructions 


The accumulation buffer controls consist of two 3 bit fields which control the alpha blend functions, 
and a 2 bit Source/Destination Select field. Combinations of these functions should be able to support 
all useful forms of OpenGL/D3D alpha blending, plus the ability to simulate multiple textures per 
surface. 
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The 2 bit Source/Destination Select field controls whether the blending uses a secondary 
accumulation buffer which can be used for special ‘multiple texture per surface’ style effects such as 
shadow and bump maps. Such effects will be done by repeating a surface in the translucency pass 
with different textures/shading modes and using the secondary buffer. 


The blend function assumes we have two RBGA values, SRC and DST, which are combined and the 
result put back into DST, via the following function: 


DST := SRC * BlendFunction(SRC Alpha Instruction) + 
DST * BlendFunction(DST Alpha Instruction) 


where 
BlendFunction (Instruction) 


takes as input the three bit instruction along with the SRC and DST colors, and returns a vector of four 
alpha weighting values - one for each of RGBA. The instruction codes are defined as: 


Instruction Field Value | Values Returned 
—— 


Zero 
One 
‘Other’ Color 


Inverse ‘Other’ Color 
SRC Alpha 
Inverse SRC Alpha 
DST Alpha 

Inverse DST Alpha 


The ‘Other Color’ and ‘Inverse Other Color’ instructions mean that when the BlendFunction is 
multiplying by SRC, the color of the DST is used and vice versa. 


The addition that is performed checks for possible overflow and clamps the resulting values 
appropriately. 


SRC / DST Select. 
The actual SRC and DST values are selected by the two bit SRC-DST Select field. 


If the SRC Select bit is set, then rather than using the colors output from the current surface’s 
shading/texturing computation, POWERVR 250 uses the contents of the secondary accumulation 
buffer as the source. (This is used to combine the results of multiple textures which are in the 
secondary buffer back into the primary accumulation buffer) 


If the DST Select bit is set, then rather than using the colors in the primary accumulation buffer 
shading/texturing, PowerVR 250 uses the contents of the secondary accumulation buffer as the 
Destination. This applies to all occurrences of DST in the above equation. 
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Fog Control 


The fog field indicates the following possibilities: 


Fog Mode Bit Pattern Description 


Look Up Table 00 Use a table indexed by 1/W and piece-wise linear 
interpolation to generate fog ‘alphas’. 


Per Vertex 01 Use the ‘fog alpha’ in the offset colors alpha. This 
is interpolated if ‘Gouraud’ is switched on, 

otherwise is constant. This bit pattern should only 
be set if the ‘Offset’ bit is also set, otherwise it will 


default to ‘No fog’. 


No fog 10 Disable fogging on the current surface 
Look Up Table 11 The fog density is calculated as mode 0, but 
Mode 2 instead of interpolating the surface color with the 


fog color, the fog color replaces the surface color, 
and the density is placed in the alpha channel. 


There are two global fog color registers. One is used when the depth look up table fog is used, while 
the second is used for the per-vertex fog. 


The Fog Mixing function is calculated as follows:- 


fogged pixel = (1.0-fog alpha)* pix col + fog alpha * fog col 
Ll = am = gt 


This Fog Mixing is performed just before the Alpha Blending unit (see Table 10. Texturing and 
Shading). There are some polygons that require fogging after the Alpha Blending unit. An example of 
this is a textured polygon that has a Shadow Map placed on top. A Shadow Map works by multiplying 
two textures together. One of the textures contain the surface detail (such as a wall), and the other 
texture contains a shading intensity that will darken the surface detail. The two textures are multiplied 
together by the Alpha Blending unit using the ‘Other Color’ and ‘Inverse Other Color’ instructions. If the 
fogging function is performed before the Alpha Blending, then the resulting image will look incorrect. 


To solve this using ‘Per Vertex’ fog, the two polygons are sent through the hardware without any fog, 
and a third polygon is placed on top. This third polygon is a non-textured surface with the fog color as 
the surface color, and the fog density in the alpha channel. This surface would then be alpha blended 
with the result of the blend of the first two polygons. 


To solve this with ‘Look Up Table’ fogging the mode ‘Look Up Table Mode 2’ has been provided. When 
this mode has been selected, the fog color replaces the surface color, and the calculated fog density is 
placed in the alpha channel. In a similar manner to the ‘Per vertex’ method, the two polygons are sent 

through the hardware without any fog, and a third polygon is placed on top. This third polygon has the 

‘Look Up Table Mode 2’ selected, and an alpha blend function of ‘SRC Alpha’ and ‘Inverse SRC alpha’ 
selected. As a result, the fog is interpolated with the result of the blend of the two polygons. 


Color Clamp 


This clamps the calculated color just before it is fogged. An Underflow and Overflow color register is 
programmed by the user. 


Use Alpha 


If set, then the per-point alpha bits in the color are relevant, and must be interpolated when Gouraud 
shading. If zero, then the alpha should be regarded as fully opaque (i.e., 1.0f or 255 in 8 bit fixed 
point). 


11 December 1998 74 Version Draft A 


a a 
Seni on cores  Vicle@oLogic 


Ignore Texture Alpha 


If the texture has an alpha value, this allows that alpha to be ignored. In effect, if the bit is set, the 
alpha of the texture is assumed to be fully opaque, i.e., 1.0. Note that this only refers to the alpha of 
the texture. 


U Mode & V Mode 


The U Mode and V Mode bits control overflow conditions on U & V as shown in the following table: 


U/V Mode Bit Pattern Description | 
Repeat 00 


Flip 01 


Texture repeats across a surface 
—————— | 


U & V values above 1 ‘flip’. The decimal part is 
used either as is or as (1-Value) 


Clamp U & V are clamped at 0 and 1. 


U & V are interpolated for the shortest distance. 
E.g. a surface with U of 0.9 to 0.1 gets 
interpolated from 0.9 upto 1.1 (a distance of 0.2) 
and not from 0.9 down to 0.1 


Filter Mode 


This field determines the type of basic texture filtering that will be applied as follows: 


Field Values Filter Mode 
0 


Point Sampled 


| 1 Bilinear Filter 


PassAB 


This field determines which pass (if multi-pass texture filtering is enabled — see the MultiPass Texture 
Filtering bit in the Texture Control Word) is currently being used. If it is Pass A then the texels are 
multiplied by the fractional (1-D), if it is Pass B they are multiplied by fractional D. 


e.g. Tri-linear is implemented as a two pass system, and the driver should ‘repeat’ the object with the 
extra pass in the translucency phase. Tri-linear is only relevant if the texture is MIP mapped, however 
the other modes will also work with non MIP mapped textures. In either case, if the texture is MIP 
mapped, the system will choose the appropriate MIP map level. 


Super-Sample texture 


Setting this bit improves the quality of the texture filtering, by performing super-sampling with the 
chosen filter mode. When MIP mapping, the hardware automatically choose the next higher resolution 
MIP map level. 


MIP-map D Adjust 


As the MIP map ‘D’ value will be calculated entirely by hardware, it may be necessary to add a ‘tweak’ 
value to forcibly make it larger or smaller to trade-off aliasing vs. blurring. The adjust value will be 
multiplied by the ‘default’ D value computed by the hardware. 
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Example ‘D’ Adjust bit pattern | Equivalent value 
00.00 Illegal 
00.01 0.25 
01.00 

11.11 


Texture/Shading Instruction 


This field determines how the interpolated shading values (Base Color and alpha and Offset color) are 


combined with the texture’s alpha and texture. 


Description 


Instruction Field Value 
Decal 0 


Modulate 1 


Decal Alpha 


PowerVR 250 
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PIXaep 
PIX, 


TEXaca + OFFSET rea 
TEX, 


Texture color multiplied by shading color. 
Texture alpha replaces shading alpha. 


PIXaca = COLacs * TEXpos + OFFSETacs 
PIX, = TEX, 


ae 


Texture color blended with shading color 
according to texture alpha. 


PIXacns = (TEXpcp * TEXa) + 
(COLpgn *(1- TEXa) ) + 
OFFSET rep 

PIX, = COLa 


Modulate Alpha 


Texture color multiplied by shading color. 
Texture alpha multiplied by shading alpha. 


PI Xpcp = COLrges * TEXpgeop = OFFSET xzcp 
PIX, = COL, * TEX, 


NOTE: This instruction is ignored if the surface is non-textured. 
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U Size 


This field gives the U (or X dimension) of the texture (unless the texture is ‘Scan Order’ and the ‘Stride 
Select’ field is set). The stored values are decoded in the obvious fashion as follows:- 


Field Value | Dimension 


V Size 


This field gives the V (or Y dimension) of the texture. It is encoded in exactly the same manner as the 
U Size field. 


NOTE: This field is ignored if the MIP-mapped bit is set. 


Texture Control Word 


Bit 31 29.27 24.23 


MIP Mapped VQ Pixel Format] Scan Order |Stride Select} Color Key | MultiPass | Palettised Texture 
Compressed Control Texture Texture Address 
Filtering 


This word is only used when the Texture bit is set in the ISP/TSP Instruction Word. 


MIP mapped 


This indicates if the texture is MIP mapped. 


VQ Compressed 


This flag indicates that the texture uses 2x2 vector compression. Each 2x2 quartet of pixels is 
represented by an eight bit code which then references a 256 entry code book [7] 
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Pixel Format 


This field determines the format of the pixels in the texture. These are defined as follows :- 


Pixel Format Hex Code Description 
1555 0 One bit alpha, 555 RGB. 
Alpha can be ignored. 
565 1 Opaque 565 
4444 2 | 4 bit alpha | 
YUV422 3 2Pixels per 32 bits. : 
Bump Map 4 MSB is S, LSB is R. 
5 This mode can only be used with point sampled / 


Non MIP mapped textures 


Reserved 


This mode can only be used with point sampled / 
Non MIP mapped textures 


Scan Order 


If set, then the texture is in a non-twiddled format as described in Appendix A. Texture Formats.. 
NOTE 1: The MIP-mapping bit will be ignored if the Scan Order bit is set. 


NOTE 2: There can be a performance penalty using this format because of the increased number of 
page breaks in the texture memory. 


Stride Select 


This flag is only relevant if the texture is in Scan Order. If set it indicates that the U (i.e. X) size of the 
texture is determined by a register setting (and the U-Size parameter is ignored). The value in that 
register is multiplied by 32. 


Color Key Control 


These bits control the color key test performed on the texture as shown below 


Enable 0 
Disable 1 


Use Color Key register 1 


Use Color Key register 2 


Color Keying can be performed on any texture format. 


Multipass texture filtering 


If this bit is set, then the hardware supports multipass texture filtering in order to achieve either linear 
interpolation between MIP Maps or tri-linear filtering (see Texture mode and PassAB bits in TSP 
Instruction Word). 


Palletised Texture 


When set in conjunction with the VQ bit, this bit determines that the texture is an 8bpp palletised 
texture. PowerVR 250 supports palletised textures by treating them as VQ textures. The palette is 
stored in the same manner as a VQ codebook and the palette index in the same manner as a VQ 
index. 
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The format that the palette stores is defined by the texture Pixel Format described above, allowing a 
palletised texture to decode into 1555, 565, 4444, or 8888 formats. 


Texture Address 


This field is bits 25 — 5 of the address of the base of the texture map. It is a 256bit aligned address 
with 64MByte range. Texture reads may be issued from one of two read address generators. The first 
is used only for VQ and palletised texture index fetches and the second is used for all other texture 
fetches. In both cases the full address for the base of the texture is the determined by the Texture 
Address and the Texture Address Registers. 


Fogging using the Look Up Table 


The user can supply fog densities at 128 depths, and the hardware will linear interpolate between 
adjacent densities. There is no restriction on the fog densities that are supplied, but the depths are 
chosen through an equation. Note: PowerVR 250 uses 1/W as the index into the fog table. 


Mapping a depth to a Fog Table entry 


1) The 1/W is scaled 
Table entry 0 is placed at a 1/W value of 1.0, and table entry 127 is placed at a 1/W value of 
256.0. The user programs a register with a floating point scalar which moves the 1/W values 
supplied in the triangle vertices into the fog table range. 


2) The 1/W is clamped 
The 1/W value is clamped to 1.0 and 255.9999. 


3) The 1/W is passed through a Log, function 
An approximation to a Log, function is performed by picking some bits out of the floating point 
representation of the 1/W value. 


The Look Up Table index is constructed as follows: 


Least significant 3 bits of 1(V Most significant 4 bits of 1AW mantissa (ignoring sign and 1.0 bits 
exponent 


Mapping a Fog Table entry to a depth 


This is the reverse of the above process, and can be represented as follows:- 
Depth =(pow(2.0 , Index>>4) * ((Index & OxF)+16) / 16.0)) / FogScale; 


Fog Implementation 


The fog scale register is called FOG_DENSITY, and the format is as follows:- 


Bits 15-6 
A mantissa with the 1.0 bit in location 15 A twos complement base 2 exponent 


For example, a register value of OxFFO7 would mean a floating point value of 255.0. This value would 
scale the fog table into the range 1.0/255.0 — 1.0. 


The fog table consists of 128 16bit locations. Each location contains two 8 bit fog densities. The most 
significant byte is the fog density used if the 1/W value is exactly at the depth of the table entry. The 
least significant byte is the fog density that is interpolated with when the 1/W is larger that the depth of 
the table entry (this should be the next fog table entry). 


Inclusion and Exclusion Volumes 


Inclusion and Exclusion Volumes allow the texturing and shading parameters of a polygon to be 
altered depending on whether the polygon is inside or outside a defined volume. Volumes can be used 
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for effects such as shadows, torch lights etc. There are two modes that PowerVR 250 can operate in. 
The two modes are ‘Parameter Selection Volume Mode’ and ‘Intensity Volume Mode’. 


In Parameter Selection Volume Mode, If the user would like a polygon to be affected by a volume, then 
one of the ‘Two Volumes’ object data types is selected, and the ‘Shadow Bit’ in the object pointer is 
set. A ‘Two Volumes’ object data type has two sets of surface parameters ( 0 and 1). If the polygon is 
inside the defined volume, then surface parameters ‘1’ are selected, otherwise surface parameters ‘0’ 
are selected. 


Intensity Volume Mode was designed to eliminate the additional memory requirement of volumes, with 
a reduction in functionality. Instead of selecting between two sets of surface parameters, the Base 
Color and Offset Color is scaled by an 8 bit register if the polygon is inside the volume. If the user 
wishes a surface to be affected by a volume, then the ‘Shadow Bit’ in the object pointer is set. The 8 
bit scale register is called FRU_SHAD_SCALE. 


An ‘Inclusion Volume’ defines a volume, and an ‘Exclusion Volume’ subtracts a volume out of an 
Inclusion Volume. 


Defining a Volume 


A volume is defined by a set of ‘Shadow Triangles’ that are added to the ‘Opaque Modifier Volume 
List’ or the ‘Translucent Modifier Volume List’. The volume the triangles define must be closed (no 
holes), but do not have to be convex. The last triangle in a volume must have the ‘Inclusion Last Poly’ 
or ‘Exclusion Last Poly’ value in the ‘Volume Instruction’ field. 


Processing a Volume 


For simplicity, we will discuss volume processing with opaque objects. For each pixel on the screen, 
PowerVR 250 will first find the closest visible polygon to the eye, then works out if this polygon is 
inside the volume. It does this by casting a ray from the eye to the opaque polygon, and counts the 
number of times the ray enters or exits the volume. If the number of crossings is odd, then the opaque 
polygon is inside the volume, otherwise the polygon is outside the volume. 


If the user would like to allow a volume to affect a translucent object, then the volume has to be added 
to the ‘Translucent Modifier Volume List’. This list will be processed once for each pass through the 
translucent list. 
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Overview of Shading/Texturing Functionality 


The following block diagram shows the functionality of the Shading and Texturing modules of 
PowerVR 250 . 


Texture Fetch Base Colour Offset Colour 


({alpha], colour) [Const or Interpolated] [Const or Interpolated] 
| 
| | 
_ a. 
Texture/Shading 
Instruction 


Colour Clamp 

MIN/MAX 

Colour Clamp _—— 
Select 


‘Fog’ Alpha 


Colour+ Alpha Colour+ Alpha Colour 


Texture/Shading Combine, and Bump Map Unit 
Interpolated Fog Density 


Depth based Fog 
Density 


Alpha Colour 


Colour Clamp 


Depth Fog 

: |Colour Reg. 
Vertex Fog 
: }Colour Reg. 


Fog Colour 


Fog density 


Fog Mixer 


Colour 


[Accumulation Buffer 
SRC & DST Select 


‘Alpha Blending’ 
: Unit 


Accumulation Buffer 
Instruction Bits | 


! : __] \To Framestore 


SRC Select Mux 


SRC 
Colour + alpha) 


Alpha Blending Unit 


Table 10. Texturing and Shading 
Pixel Sampling 
PowerVR 250 is OpenGL compliant with regard to the sampling of edges of polygons - i.e., it follows 


a certain set of rules. These rules state that for any two polygons sharing the same edge (as in the 
diagram below), there must be no gaps between pixels and no pixel should be drawn twice. 


=... Shared Edge 


Specifying that no pixel is drawn twice is important for drawing order-independence so that pixels 
won't ‘flash’ along the edge, and, more importantly for inclusion/exclusion volumes, so that the 
inside/outside processing doesn’t get confused. 
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Translucency Pass Sorting 


Although standard Z buffers give per-pixel depth sorting of opaque objects, to correctly handle 
translucent objects, the software must revert to sorting the translucent faces into back to front order 
(such as in Newell, Newell, and Sancha’s algorithm). 


PowerVR 250 has a mode to do per-pixel sorting of translucency. Overlapping faces will automatically 
be sorted into back to front order. For compatibility with OpenGL, this mode can also be disabled via a 
register setting (ISP_FEED_CFG). The algorithms for these modes are discussed below. 


Automatic Translucency Sorting Mode 


The automatic sorting will use the ‘Depth Compare Mode’ for that object. The hardware will find the 
furthest away translucent object on the first pass (rejecting objects obscured by opaque objects). In 
the next pass, the hardware will find the next furthest away translucent object and this will continue 
until there is no more translucent objects . 


If two translucent objects have exactly the same depth, they are rendered in the order that they are 
sent to the Tile Accelerator. 


‘Depth buffer’ Translucency Mode 


This mode is included to support arbitrary depth compare and depth buffer enable/disable settings. For 
all intents and purposes, it simulates the behaviour of a traditional Z-Buffered system. NOTE: The 
‘Translucent Modifier Volume List’ will be ignored in this mode. This is because the hardware would 
have to process this list for each translucent polygon. 
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3D Registers 
3D Soft Reset 


Index 0x0008 
Reset Value 0x0000 


15-2 


a er 


PVR_RST 


PVR_RST (RW) 


Setting this bit puts the 3D block into reset. 


Start Render 
Index 0x0014 
Reset Value 0x0000 


START_RENDER 


START_RENDER (WO) 


Any write to this register causes 3D rendering to commence. 


Region Header Base 


Index 0x002C 
Reset Value 0x0000 


31-28 
REGIONHOR_ BASE | 00> 


REGIONHDR_BASE 


Bits 27 — 2 of the region header base address. Together with address remapping in the arbiter it is 
used to locate the base of the region header array by the hardware anywhere in the 4GB address 


space on a 32bit word boundary. 


Object Base 
Index 0x0020 
Reset Value 0x0000 


31 - 28 27 - 20 
OB_PARAM_BASE 


OBJECT_PARAM_BASE (RW) 


19-0 
0000 


Bits 27 — 20 of the base address for 3D objects. Together with address remapping in the arbiter 
objects may be positioned anywhere in the 4GB address space with 1MB alignment for the start 


address for objects. 
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Texture Address Registers 


Codebook AGP Offset 
Index 0x0060 
Reset Value 0x0000 
28-0 


CBOOK_AGP_OFFSET 


CBOOK_AGP_OFFSET (RW) 

Bits 31 — 3 of the physical base of GART. This allows the placement of the GART (for codebook / 
normal texture fetches) to be anywhere in the 4GB address space aligned on a 64bit boundary. 
Codebook FB Offset 


Index 0x0064 
Reset Value 0x0000 


31-29 
CBOOK_FB_OFFSET 


CBOOK_FB_OFFSET (RW) 


Bits 31 — 3 of the physical base of framebuffer. This allows the placement of the framebuffer (for 
codebook / normal texture fetches) to be anywhere in the 4GB address space aligned on a 64bit 
boundary. 


Codebook Ref Address 
Index 0x005C 
Reset Value 0x0000 


15-10 ee ee 
BOOK REF_ADDR 


CBOOK_REF_ADDR (RW) 


Reference address 


If this value is less than the top 10 bits of the texture address used for a code book access then the 
code book accessed is made at the following address, 


Addr = (TextAddr - RefAddr) + CBOOK_AGP_OFFSET. 
Otherwise it is made at address: 
Addr = TextAddr + CBOOK_FB_OFFSET. 


This operation is addition to the GART boundary and base address. It is used to allow texture 
addresses to overflow from framebuffer to GART. 
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Index AGP Offset 


Index 0x0050 
Reset Value 0x0000 


28-0 
INDEX_AGP_OFFSET 


INDEX_AGP_OFFSET (RW) 


Bits 31 — 3 of the physical base of GART. This allows the placement of the GART (for VQ or palletised 
index texture fetches) to be anywhere in the 4GB address space aligned on a 64bit boundary. 


Index FB Offset 


Index 0x0054 
Reset Value 0x0000 


31-29 


INDEX_FB_OFFSET 


INDEX_FB_OFFSET (RW) 


Bits 31 — 3 of the physical base of framebuffer. This allows the placement of the framebuffer (for VQ or 
palletised index texture fetches) to be anywhere in the 4GB address space aligned on a 64bit 
boundary. 


Index Ref Address 
Index 0x0040 
Reset Value 0x0000 


15-10 ee 


INDEX_REF_ADDR 


INDEX_REF_ADDR (RW) 


Reference address 


Use is a as per CBOOK_REF_ADDR, but only affects Index (VQ or palletised) fetches from the 
texture addressing unit. 


Span Sort Control 


Index 0x0030 
Reset Value 0x0000 


31-17 | te | to | ls | es | 
Cache_Bypass Offset_Sort_En Span_SorEn 
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Span_Sort_En, Offset_Sort_En 


Offset_Sort_En and Span_Sort_En are used together as follows: 
Offset_Sort_En : Span_Sort_En 


00 No sorting 

01 Groups together run length encoded spans that have 
the same tag AND the same strip offset 

10 Illegal 

11 Stage 1, groups run length encoded spans with the 


same tag, IGNORING the strip offset; Stage 2 
magnitude sorts elements of a group based upon the 
strip offset 


Mode “11” is the recommended mode of operation. Grouping minimises the TSP pre-calculation, and 
offset sorting allows the TSP parameter fetch engine to use strip optimisations, reducing bandwidth. 


Cache_Bypass 


When set Cache_Bypass bypasses the TSP parameter cache. Normally set to 0. 
TSP Latency Count 

Index 0x0044 

Reset Value 0x0000 


15-4 3-0 


Reserved LAT_CNT 


LAT_COUNT (RW) 
A performance tweaking register. It holds off texture requests by between 0 & 7 clocks. 
3D Write Control 1 


Index 0x0048 
Reset Value 0x0000 


31-24 23-16 : 
FB_ALPHA_THRESHOLD FB_KVAL FB_DITHER | FB_PACK_MODE 


FB_PACK_MODE (RW) 


Le) 
fo) 


Pixel packing mode for 3D pixel writes. 


000 555 RGB 16 bit 

001 565 RGB 16 bit 

010 4444 ARGB 16 bit 

011 1555 ARGB $16 bit 

100 888 RGB 24 bit packed 
101 0888 RGB 32bit 

110 8888 ARGB 32 bit 

111 Reserved 


FB_DITHER (RW) 


1 apply dither to pixels writing 16 bit pixels (555, 565, 4444) to frame buffer memory 
0 truncate pixels when writing 16 bit pixels to frame buffer memory 
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FB_KVAL (RW) 


Value of K when writing pixels into frame buffer. See FB_PACK_MODE. This is only valid when using 
an 8888 packing mode. 


FB_ALPHA_THRESHOLD (RW) 


Threshold for encoding 1555 mode. When pixel output is in 1555 format, if the alpha value is greater 
than FB_ALPHA_THRESHOLD then a 1 is written else a 0 is written. 


3D Write Stride 
Index 0x004C 
Reset Value 0x0000 


a 
F_W_LINESTRIDE 


bits 8:0 Frame Buffer Line Stride [Default = 0x0] 


The width of a line when writing into the frame buffer memory. This value specifies multiples of 128 bit 
words (i.e. 16 byte resolution). 


3D Write X Clipping 


Index 0x0068 
Reset Value 0x0000 


31-27 26-16 
FB_XCLIP_MAX FB_XCLIP_MIN 


The values in this register are used when writing 3D pixels into the frame buffer memory. They clip the 
left and right extents. 


FB_XCLIP_MIN (RW) 


Frame Buffer x clipping min 


Pixels with an x co-ordinate less than this value are not written to the frame buffer. 
FB_X_CLIP_MAX (RW) 

Frame Buffer x clipping max 

Pixels with an x co-ordinate greater than this value are not written to the frame buffer. 


e.g. for an X extent of 640, these registers would normally be set to 0 and 639 respectively. 
3D Write Y Clipping 


Index 0x006C 
Reset Value 0x0000 


31-27 26-16 a 
FB_YCLIP_MAX FB_YCLIP_MIN 


The values in this register are used when writing 3D pixels into the frame buffer memory. They clip the 
top and bottom extents. 


FB_YCLIP_MIN (RW) 


Frame Buffer y clipping min 
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Pixels with a y co-ordinate less than this value are not written to the frame buffer. 


FB_YCLIP_MAX (RW) 
Frame Buffer y clipping max 

Pixels with a y co-ordinate greater than this value are not written to the frame buffer. 

e.g. for an Y extent of 480, these registers would normally be set to 0 and 479 respectively. 
FPU Shadow Scale 


Index 0x0074 
Reset Value 0x0000 


pts | 
SIMPSHAD_EN SIMPSHAD_SCALE 


bits 7 - 0 Scale factor for shadows 
bit 8 Simple Shadow Enable 


This register controls the inclusion/exclusion volume mode. 


SIMPSHAD_SCALE (RW) 


Scale factor for shadows. For Intensity Volume Mode, Base Color = (scale factor)/256. The same 
scaling is applied to the Offset Color. 


SIMPSHAD_EN (RW) 


Simple Shadow Enable 
0 - Parameter Selection Volume Mode 


1 - Intensity Volume Mode 


FPU Cull Value 
Index 0x0078 
Reset Value 0x0000 


30-0 
FP_CULL_VAL (RW) 
IEEE floating point value for culling compare of the determinant. 
FPU Parameter Control 


Index 0x007C 
Reset Value 0x010477 


31-20 19-14 
TSP_BURST_THOLD ISP_BURST_THOLD FPU_PTR_BURST FPU_PTR_FIRSTBURST 


TSP_BURST_THOLD (RW) 


TSP parameter burst trigger threshold. As per ISP parameter burst trigger. 
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ISP_BURST_THOLD (RW) 


ISP parameter burst trigger threshold. Parameter requests are initiated when the space in the request 
FIFO is equal to (or greater than) this register. Lager values lead to longer bursts and fewer page 
breaks, at the expense of greater latency for other requesters to the memory interface. 


FPU_PTR_BURST (RW) 


Pointer burst size. Pointers are requested in bursts of this length. Program to the either the same or 
less than the TA_LALLOC_CTRL registers. Larger Values imply fewer page breaks. This field makes 
no difference in practise and should not be changed. 


Legal values for FPU_PTR_BURST are 0 to OxOE. OxOF is not a legal value and must not be used. 


FPU_PTR_FIRSTBURST (RW) 


pointer first burst size. Program to half the pointer burst size. 


Pixel Half Offset 


Index 0x0080 
Reset Value 0x0003 


Ee 
TSP_TEX_SAMPLE TSP_PIX_SAMPLE FPU_PIX_SAMPLE 


FPU_PIX_SAMPLE (RW) 


FPU pixel sampling position 
0 (0,0) 
1 (0.5,0.5) [Default] 


TSP_PIX_SAMPLE (RW) 


TSP pixel sampling position 
0 (0,0) 
1 (0.5,0.5) [Default] 


TSP_TEX_SAMPLE (RW) 


TSP texel sampling position 
0 (0,0) 
1 (0.5,0.5) [Default] 


FPU Perpendicular Compare 


Index 0x0084 
Reset Value 0x0000 
31 30-0 


FP_PERP_VAL 


FP_PERP_VAL (RW) 


IEEE floating point value for perpendicular triangle compare 
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ISP Background Depth 


Index 0x0088 
Reset Value 0x0000 


31-4 3-0 


ISP_BACKGND_D 


If a valid Region Header has been defined (i.e. one which has not got the Empty PTR bit set), then 
PowerVR 250 always renders an opaque object at every pixel in the 32x32 region. If the opaque 
triangles added to the region do not cover every pixel, then PowerVR 250 will render the Background 
Object at those pixels. The Background object is defined by the ISP Background Depth and Tag 
registers. 


ISP_BACKGND_D (RW) 
Background Plane depth parameter (IEEE floating point value, bits 3-0 truncated). 
ISP Background Tag 


Index 0x008C 
Reset Value 0x0000 


31-29 
BG_HINT BG_SHAD BG_SKIP BG_TAG_ADDR BG_TAG_OFF 


Background plane tag parameter 


BG_HINT (RW) 


cache hint — will this tag be used in the next tile 


BG_SHAD (RW) 
shadow parameters present — see 3D format (set to 0) 


BG_SKIP (RW) 
skip — see 3D format 


BG_TAG_ADDR (RW) 
tag address — Pointer to 3D parameters 


BG_TAG_OFF (RW) 
tag offset — Offset in strip (set to 0) 


ISP Sort Control 
Index 0x0098 
Reset Value 0x0000 


15-3 a ee a ee ee 
isp_revsort_| isp_zsort__| _Isp_pRESoRT 


ISP_PRESORT (RW) 


pre-sort mode: 
0: auto sort 
1: pre-sort 
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ISP_ZSORT (RW) 


Z sort :0 — 1/w sort 


ISP_REVSORT (RW) 
Reverse sort: 0 —LTE sort 
3D Write Control 2 


Index Ox00A4 
Reset Value 0x0000 


30-29 28-0 
Reserved | 3D_WRITE_PRIORITY 3D_WRITE_SOF_ADDR 


3D_WRITE_SOF_ADDR (RW) 


3D_WRITE_PRIORITY (RW) 


Fog Table Color 
Index 0x00BO 
Reset Value 0x0000 


31-24 23-16 
TBLFOG_COL_R TBLFOG_COL_G TBLFOG_COL_B 


TBLFOG_COL_R, TBLFOG_COL_G, TBLFOG_COL_B(RW) 


Fog Vertex Color 
Index 0x00B4 
Reset Value 0x0000 


31-24 
VTXFOG_COL_R VTXFOG_COL_G VTXFOG_COL_B 


VTXFOG_COL_R, VTXFOG_COL_G, VTXFOG_COL_B (RW) 


Fog Density 
Index 0x00B8 
Reset Value 0x0000 


FOG_DENSITY (RW) 
TSP fog multiplication value see 3D format 


Fog Clamp Max 


Index 0x00BC 
Reset Value 0x0000 


23-16 
FOG_CLAMPMAX_A FOG_CLAMPMAX_R FOG_CLAMPMAX_G FOG_CLAMPMAX_B 
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FOG_CLAMPMAX_A, FOG_CLAMPMAX_R, FOG_CLAMPMAX_G, FOG_CLAMPMAX_B (RW) 


Pre-fog clamp color - maximum value ARGB 


Fog Clamp Min 
Index 0x00CO 
Reset Value 0x0000 


FOG_CLAMPMIN_A, FOG_CLAMPMIN_R, FOG_CLAMPMIN_G, FOG_CLAMPMIN_B (RW) 


Pre-fog clamp color - minimum value ARGB 


Fog Table 
Index 0x0200 — 0x03FC 
Reset Value 0x0000 


15-0 


FOG_DATA 
FOG_DATA (RW) 
Texture Control 


Index Ox00E4 
Reset Value 0x0000 


14-8 5-0 


TEX_STRIDE 


= 
oa 


Reserved TEX_BANK_BIT 


TEX_STRIDE (RW) 


Horizontal stride in 32byte increments (min = 32, max = 2048). Only relevant when stride mode is 
selected. 


TEX_BANK_BIT (RW) 


Gives position of the bank bit for accessing the SDRAM. It is XORed with top seven bits of offset 
address into texture map. Only used during index accesses to allow toggling between banks when 
fetching Index and Codebook entries. The XOR occurs both when accessing textures in framebuffer 
memory and host memory. 


Texture Endian Control 


Index Ox00E8 
Reset Value 0x0000 


15-4 10 
Reserved TEX_CB_ENDIAN TEX_INDEX_ENDIAN 


TEX_INDEX_ENDIAN, TEX_CB_ENDIAN (RW) 


Index endian control 
Code book/normal texture endian control. 
Endian control value 


00 : No swap 
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01 : Swap 16 bit words 
10 : Swap bytes within 16 bit words 
11: Swap bytes. 


Only affects texture fetches and performs an endian swap on the data as described. 
3D Scaler Control 


Index Ox00F4 
Reset Value 0x0000 


31-17 15-12 " 10-0 
20_H2SCALE_EN 30_VISCALE_EN Reserved 


3D_VTSCALE_EN (RW) 


Vertical scaling enable 
0 - Vertical scaler disabled 
1 - Vertical scaler enabled 


3D_HZSCALE_EN (RW) 
Horizontal scaling enable 


0 - Horizontal scaler disabled 
1 - Horizontal scaler enabled 


Each scaler scales down by a fixed scale factor of 0.5 - i.e. scale down by 2. Filtering of the scaled 
down image is restricted to averaging of pairs of adjacent pixels. 


3D Color Key 1 


Index 0x0124 
Reset Value 0x0000 


31-24 23-16 
oKILR oK1_G okt 


3D Color Key 2 


Index 0x0128 
Reset Value 0x0000 


31-24 23-16 
oK2 6 


3D Diagnostic Registers 


BP Count 
Index 0x019C 
Reset Value 0x0000 


TSP_BP_COUNT 


Diagnostic register. 


19-0 | 


TSP_BP_COUNT (RW) 


Number of span groups per scene which are forced to bypass the cache because it is known that the 
surface is not present in the following tile. 


Version Draft A 93 11 December 1998 


_] _ 
VideoLogic Confidential seaagpeeraiaa tate 


Initialised to 0 on reset or start render. Register data becomes valid after end of render. 


Diagnostic1 
Index 0x01A0 
Reset Value 0x0000 


31 ~20 
VQOUT_STALL, COUNT 


VQSTALL_COUNT (RW) 


VQ module : Number of cycles per scene where VQ output is valid but is stalled 


Initialised to 0 on reset or start_render. Register data becomes valid after end_of_render. 


Diagnostic2 
Index 0x01A4 
Reset Value 0x0000 


31-20 19-0 
VQIN_STALL_COUNT 


VQIN_STALL_COUNT (RW) 

VQ module : Number of cycles per scene where input to VQ is valid but is stalled by VQ 
Initialised to O on reset or start_render. Register data becomes valid after end_of_render. 
TSP Cache Hit Count 


Index 0x0194 
Reset Value 0x0000 


31-20 
Reserved TSP_CACHE_HITS 


TSP_CACHE_HITS (RW) 

TSP Parameter Cache : Number of cache hits per scene 

Initialised to 0 on reset or start_render. Register data becomes valid after end_of_render. 
TSP Cache Miss Count 


Index 0x0198 
Reset Value 0x0000 


31-20 19-0 
TSP_CACHE_MISSES 


TSP_CACHE_MISSES (RW) 


TSP Parameter Cache : Number of cache misses per scene 


Initialised to 0 on reset or start_render. Register data becomes valid after end_of_render. 
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TSP Signature 


Index 0x0180 
Reset Value 0x0000 


VideoLogic 


FB_SIGNATURE 


FB_SIGNATURE (RW) 


Signature testing register for input to frame buffer 


ISP Signature 
Index 0x0184 
Reset Value 0x0000 


ISP_TSP_SIGNATURE 


ISP_TSP_SIGNATURE (RW) 


Signature testing register for ISP/TSP interface (lower) 
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DAC FEED 


The DAC feed takes instructions from the Primary Core, generates addresses to read a display list 
(describing the active display and it’s contents) and pixels from framebuffer, and produces red, green, 
blue and sync signals to connect to a monitor. 


Addresses to Data from 
memory memory 
Instructions from Address _ FIFO 
primary core generation 


Format 
stream 


Slave 
stream 


eee 


Digital out RGB and sync 
out 


Table 11. DAC Feed 


The DAC feed is designed to be very flexible, the display format being almost entirely controlled by a 
display list fetched from memory. The format stream is always active and can process data from 
memory at single pixel granularity with arbitrary pixel format, color space and scaling. It is also used to 
generate the syncs and blanking that form the display. 


The slave stream is used to provide color key capability and runs at programmed width, height, stride, 
and pixel format but has no YUV or scaling functionality. 


Support for both TV and LCD output in PowerVR 250 is provided by an RGB Port which can be 
connected to an external encoder such as the Rockwell Bt868/9 or panelink interface such as Sil150. 
The port may be used in either 12 bit multiplexed or 24-bit non-multiplexed mode - sharing 12 of the 
ROM address pins. The Video Out port is modal between multiplexed mode and non-multiplexed 
mode. 


Instructions to the DAC Feed take the form of spans which consist of a pointer to a display list entry 
word in framebuffer and the width, height and address of a region. A span instruction and it’s 
associated display list word can represent part of a line, a whole line, or multiple lines but only 
represents data of a single format with a single scaling factor. 
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63 — 45 44 43 42 - 40 39 38 | 37 
Reserved DAC_DUAL Reserved DAC_PIXFMT DAC_HS DAC_VS DAC_SCALE 


35-83 91-27 26-24 
DAC_CSC DAC_LUT DAC_ENDPIX 


DAC_DUAL 


Instructs the DAC Feed to fetch slave stream data as well as format stream data for this span. 


DAC_PIXFMT 


The pixel format of this span of pixels. 


000b_—s 8 bit indexed color 

001b 16 bit 555 RGB 

010b 16 bit 565 RGB 

011b 24 bit 888 RGB 

100b 32 bit x888 RGB 

101b 16 bit 422 YUV in UYVY byte sequence 
110b 24 bit 444 YUV in UYV byte sequence 
111b 16 bit 422 YUV in YVYU byte sequence 


DAC_HS 

When this bit is set the DAC Feed generates an HSYNC before this span is processed. 
DAC_VS 

When this bit is set the DAC Feed generates a VSYNC for the duration of the span. 
DAC_SCALE 

Enables the horizontal scaler for this span. 

DAC_CSC 


Enables the YUV to RGB color space converter for this span. 


DAC_LUT 


Enables the LUT for this span. If used in high color modes, the LUT provides gamma correction. If 
turned off in indexed color mode, a greyscale output is produced. 


DAC_ENDPIX 


The byte position within the last 64bit word where the last pixel of this span starts. 
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DAC_STARTPIX 


The byte position within the first 64bit word where the first pixel of this span starts. 


For example, a span of 6 pixels of 24bit RGB could be arranged in memory thus :- 


Word 0 


fv Fas fe fo PR 
lwos2 | | | | |e 
DAC_STARTPIX would be 0 
DAC_ENDPIX would be 7 


In 24 bit modes where a pixel can straddle two words, DAC_ENDPIX refers to the byte position within 
the penultimate word if the last pixel crosses a word boundary. 


DAC_CHROMA_MODE 


Controls conversion of 422 YUV to 444 YUV 
00b no conversion 
01b replicate chroma pixels 
10b interpolate chroma pixels 


DAC_PADPIX 


Number of pixels to generate from the last pixel into the scaler. This allows pixel accurate scaling up to 
x16. 


DAC_SCALE_FACTOR 


Scaling factor. This is actually an inverse factor with implied binary point before the most significant bit. 
The scaler generates N output pixels where :- 

N = (InputPixels - 1) * (INT((1/DAC_SCALE_FACTOR) << 11) >> 11) + DAC_PADPIX 
For example: 


InputPixels = 128 
DAC_SCALE_FACTOR = 10000000000b 
DAC_PADPIX = 0010b 


will produce 256 output pixels. 


DAC Feed / Primary Core Interface 


The Primary Core sends instructions to the DAC feed by writing to it's DAC Feed port. The type of 
instruction is encoded in the top two bits of the 64-bit word sent. Unused instruction encodings are 
reserved and must not be used. 


Span Instruction 


eno | ot | wos 
| oo ac sF DAC_SOL DAC_DLW_IDX 


52-42 41-32 31-26 24-0 
DACSPAN_H DACSPAN_W DAC_GETBLNK DACSPAN_ADDR 
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The Span Instruction causes the DAC feed to fetch a display list word and a span of pixels from 
framebuffer. 


When running a span instruction, the sequence of operations the DAC feed performs is :- 


If the cursor is present on this line and DAC_SOL is set, fetch cursor data for this line. 
Fetch display list word. 
Fetch pixel data. 


If the span is greater than one line (DACSPAN_H > 0), the above sequence is repeated DACSPAN_H 
times, adding the frame buffer stride to the pixel fetch address each time 


DAC_SOF (WO) 


Start of frame flag (resets Y counter). This is set in the first span instruction for a new frame and also 
implies start of line (DAC_SOL should not be set when this bit is set). 


DAC_SOL (WO) 
Start of Line flag (advances Y counter). 
DAC_DLW_IDX (WO) 


Display list words are organised in memory as 32 64bit words in a 256 byte page. This field 
determines which word in the page is read. 


DACSPAN_H (WO) 


(Height of span region — 1) in lines. These 11 bits allow up to 2048 lines to be described in a single 
span. 


DACSPAN_W (WO) 


(Width of span region — 1) In 64bit words. These 10 bits allow up to 8KB to be described in a single 
span e.g. 2k pixels at 32bpp. 


DAC_GETBLNK (WO) 


Blanking flag. When set, dummy (black) pixels are inserted instead of data being fetched from the 
framebuffer. 


DACSPAN_ADDR (WO) 


Byte address location from which to fetch the pixel span. The bottom 3 bits are ignored, so the span 
must be aligned on a 64bit boundary. The top 7 bits are defined in DAC_FB_BASE which provides a 
full 4GB address range. 


Load Register Instruction 


Reset Value 0x0000 


63 - 62 61-34 33 - 32 31-0 
| ot | Reserved __|DAC_REG_ADDR DAC_REG_DATA 


The format of the 64bit word to the DAC Feed changes depending upon the target register which is 
itself encoded in bits 33 — 32. 


DAC_REG_ADDR (WO) 
Valid register addresses are 0x01, 0x02 and 0x03. 
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DAC_REG_DATA (WO) 


Data which is specific to each register. 


DAC Register 1 


Reset Value 0x0000 


31-28 26-22 


DAC_FIELD DAC_ILACE DAC_CUR_ON | DAC_CUR_HT | DAC_CUR_BASE 


DAC Register 2 
Reset Value 0x0000 
DAC_DLW_PAGE Reserved 


DAC Register 3 


Reset Value 0x0000 
sieve | te —“<i~*rS Cd 
DAC_FB_STRIDE DAC_FB_BASE 


DAC_FIELD (WO) 


When the DAC Feed is operating in interlaced mode, this bit determines which the current field is (‘0’ is 
even and ‘1’ odd). 


DAC_ILACE (WO) 

Turns on interlace mode for hardware cursor generation. 

DAC_CUR_ON (WO) 

Turns on the hardware cursor. 

DAC_CUR_HT (WO) 

Sets the height of the hardware cursor. This is programmed with (Cursor Height — 1). 


DAC_CUR_BASE (WO) 


Bits 24 — 10 of the address in framebuffer of the cursor data. Bits 31 — 25 are defined in the 
DAC_FB_BASE field (see below). This allows the cursor data to be put anywhere in 4GB address 
space aligned on a 1KB boundary. 


DAC_DLW_PAGE (WO) 


Display list words are organised into 256byte pages. This field is the start address of a page which is 
aligned on a 256byte boundary. 


DAC_FB_STRIDE (WO) 


Sets the framebuffer stride in bytes. 


DAC_FB_BASE (WO) 


Defines the top 7 bits of the cursor data address and pixel span address. 
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DAC Feed Implicit Registers 
DAC Flow Control 


Implicit Address O0x00A0 
Reset Value 0x0000 


: re ee 
DAC_STARTSLAVE DAC_SLAVESLOTS DAC_STARTFORMAT DAC_FORMATSLOTS 


The values in this register should only be changed at mode change time when the DAC Feed is not 
fetching pixels. 


DAC_FORMATSLOTS, DAC_SLAVESLOTS (RW) 


The DAC Feed FIFO is used for buffering both slave and format stream data. These fields control the 
split of FIFO space allocated between the slave and format streams and should always be set up such 
that DAC_SLAVESLOTS + DAC_FORMATSLOTS <= 0x7F. 


DAC_STARTFORMAT, DAC_STARTSLAVE (RW) 

These bits enable data flow from the FIFO into the DAC pipeline. They are used in conjunction with the 
DAC_STARTFETCH bit to pre-fill the FIFO. 

DAC Burst Control 


Implicit Address 0x00A1 
Reset Value 0x0000 


14-10 
DAC_STARTFETCH DAC_SLAVETOK_SIZE DAC_FORMATTOK_SIZE DAC_MEMBURST_SIZE 


DAC_MEMBURST_SIZE (RW) 
The size of memory burst in 64bit words that the DAC Feed will attempt. 


DAC_FORMATTOK_SIZE, DAC_SLAVETOK_SIZE (RW) 


Token size for slave / format streams. These should be set to the same value used for 
DAC_FORMATSLOTS and DAC_SLAVESLOTS in normal operation. If they are smaller then they 
increase FIFO utilisation at the expense of memory bandwidth utilisation. They may be useful for 
tuning very high bandwidth modes. 


DAC_STARTFETCH (RW) 


Starts DAC Feed memory fetches into the FIFO. 


When starting dual stream modes, allow address fetches to start several mS before allowing pipelines 
to start. This ensures there is no data starvation and stuttering at startup which can cause 
synchronisation between streams to be delayed for several frames. 


The address dispatcher keeps a count of pending fetches for each stream which is incremented by the 
corresponding token amount whenever that many data items has been removed from the FIFO. The 
next address burst is prioritised between format stream and slave stream depending upon how many 
pending fetches there are. Once a stream has been chosen for a burst, the address dispatcher will 
attempt to send DAC_MEMBURST_SIZE addresses. The burst will terminate prematurely if there are 
no more pending addresses for that stream. 
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DAC FIFO Split 
Implicit Address Ox00A2 
Reset Value 0x0000 


ae = Sn (NSE © 
DAG_FIFO_SPLIT 


DAC_FIFO_ SPLIT (RW) 


The number of FIFO slots to allocate to the slave stream. The FIFO has 127 slots which can be 
partitioned between the format stream and the slave stream depending upon the required bandwidth 
for each stream. This field must equal the value set in DAC_SLAVESLOTS + 1 


DAC HS Front Porch 


Implicit Address 0x00A3 
Reset Value 0x0000 


DAC_HSYNC_FP (RW) 


The length of the Horizontal Sync front porch in Pixel Clocks. 
DAC HS Back Porch 


Implicit Address Ox00A4 
Reset Value 0x0000 


15-11 
DAC_HSYNC_BP 


DAC_HSYNC_BP (RW) 


The length of the Horizontal Sync back porch in Pixel Clocks. 


DAC HS Width 
Implicit Address Ox00A5 
Reset Value 0x0000 
Ranenved 


DAC_HSYNC_WIDTH (RW) 


The width of the Horizontal Sync pulse in Pixel Clocks. 


DAC Sync Control 
Implicit Address 0x00A6 
Reset Value 0x0000 


15-14 ee ee ee ee ee ee ee 


DIG_CK_POL | DIG_PIXFMT | DIG_VS_POL | DIG_HS POL} DIG_SLAVE | DIG_ROM 
| 7 | vce [| cs | cs | ts | 2 ft a fo | 
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DIG_MUX DIG_PORT_EN | DAC_NATIVE DAC_REP |DAC_HS_POL| DAC_HS_EN |oac_vs_PoL| DAC_VS_EN 


DIG_CK_POL (RW) 


Digital clock polarity 


0 Normal clock polarity 
1 Inverted clock polarity 


DIG_PIXFMT (RW) 


Digital pixel format 


0 Data are 24bpp 
1 Data are 18bpp (6 MSBs of R, G and B are used) 


DIG_VS_POL (RW) 


Vertical sync polarity of the digital port. 


0 Vertical Sync polarity is active low 
1 Vertical Sync polarity is active high 


DIG_VS_POL (RW) 


Horizontal sync polarity of the digital port. 


0 Horizontal Sync polarity is active low 
1 Horizontal Sync polarity is active high 


DIG_SLAVE (RW) 


Selects digital port (and DAC) to operate in either sync master or sync slave mode. 


0 DAC operates as sync master (H & V Syncs and Blank are outputs) 
1 DAC operates as sync slave (H & V Syncs are inputs, Blank is ignored) 
DIG_ROM (RW) 
The digital port can use ROM address bits 15:4 when outputting data. 
0 Video data output to ROM address bits 15:4 disabled 
1 Video data output to ROM address bits 15:4 enabled 
DIG_MUX (RW) 
Selects between multiplexed 12-bit and non-multiplexed 24-bit operation. 
0 Video data multiplexed by VQ_CLOCK 
1 Video data non-multiplexed 


DIG_PORT_EN (RW) 


Enables the digital port. 


0 Digital port disabled 
1 Digital port enabled 


DAC_NATIVE (RW) 


0 DAC Feed operates in VGA mode. No display list is required to operate in this mode. 
1 DAC Feed operates in native display list mode 
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DAC_REP (RW) 


0 DAC Feed scaler interpolates data 
1 DAC Feed scaler replicates data 


DAC_HS_POL, DAC_VS_POL (RW) 


0 Horizontal / Vertical Sync is positive polarity 
1 Horizontal / Vertical Sync is negative polarity 


DAC_HS_EN, DAC_VS_EN (RW) 


0 Horizontal / Vertical Sync is on 
1 Horizontal / Vertical Sync is off — the polarity is controlled by DAC_HS_POL / 
DAC_VS_POL 


DAC Cursor X Position 


Implicit Address 0x00A7 
Reset Value 0x0000 


15-13 


Reserved DAC_CURSOR_X 


DAC_CURSOR_X (RW) 


X position of the cursor relative to the start of HSync in pixels. This is a 13bit signed number. 
DAC Cursor Y Position 


Implicit Address 0x00A8 
Reset Value 0x0000 
ee 


DAC_CURSOR_Y (RW) 
Y position of the cursor relative to the start of VSync in lines. This is a 12bit signed number. 
DAC CLUT Write Address 


Implicit Address 0x00B4 
Reset Value 0x0000 


DAG_CLUT_WADDR 


DAC_CLUT_WADDR (RW) 
Sets the address of the LUT entry for writes. 
DAC CLUT Read Address 


Implicit Address 0x00B7 
Reset Value 0x0000 


DAG_CLUT_RADDR 
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DAC_CLUT_RADDR (RW) 


Sets the address of the LUT entry for reads. 


DAC CLUT Data 
Implicit Address Ox00B5 
Reset Value 0x0000 


DAC_CLUT_DATA (RW) 


The LUT is programmed by setting the write address then writing data to the LUT data register. Data is 
written in order Red, Green, Blue. After each triple, the write address is incremented. Data read occurs 
in the same way. The read address and write address registers are actually the same physical register 
so changing one will change the other. 


DAC CLUT PixelMask 


Implicit Address 0x00B6 
Reset Value 0x0000 


This register has no effect and is purely present for VGA compatibility. 


DAC Control 
Implicit Address Ox00A9 
Reset Value 0x0000 


-0 | 9 | 2 | 7 | 6 | ss | 2 | + | o | 
R a 


Reserved 6BIT |COMP_EN} DAC_DIS VREF_EN eserved |COMP_RED| COMP_GREEN | COMP_BLUE / 
/DAC_SETUP | DAC_BLNK 


6BIT (RW) 
0 CLUT is in 8bit mode 
1 CLUT is in 6bit mode (Top 2 bits of CLUT entry are ignored) 


COMP_EN (RW) 


0 Comparator is in power down mode 
1 Comparator is enabled. 


The comparator is used to detect the presence of a monitor. When the comparator is enabled, a 33% 
grey image is generated and COMP_RED, COMP_GREEN and COMP_BLUE can be read to detect a 
load on those signals. 


DAC_DIS (RW) 
0 DAC is enabled 
1 DAC is in power down mode 


VREF_EN (RW) 


0 VREF is in power down mode 
1 VREF is enabled 
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In normal operation PowerVR 250 generates it's own VREF signal. If an external VREF signal is 
generated, then the internal VREF generation must be disabled. 


COMP_RED, COMP_GREEN, COMP_BLUE (RO) 


Reads the comparator for the specified color channel on the DAC. 


DAC_SETUP (WO) 


This is the write only bit shared with COMP_GREEN. It controls pedestal setup on the blanking signal 
produced by the DAC, but is unused and must be set to 0. 


DAC_BLNK (WO) 


This is the write only bit shared with COMP_BLUE. When 0, the DAC always outputs blanking, when 
1, the DAC operates normally. 


In Normal operation, the CLUT would be set to 8bit mode and the comparator, DAC and Vref would all 
be enabled and DAC_BLNK would be set to 1. 


DAC CRC Control 


Implicit Address Ox00AA 
Reset Value 0x0000 


crc_status | crc_start | Rc set 


CRC_SEL (RW) 


Selects color channel on which to perform CRC check. 


00b CRC on red 
01b CRC on green 
10b CRC on blue 


CRC_START (RW) 


0 Resets the CRC Result returned in the CRC Result Register 
1 Starts accumulation of CRC for the chosen channel starting at the next VSync trailing 
edge and finishing at the next VSync leading edge. 


CRC_STATUS (RO) 


This bit is set when the CRC check has completed. 
DAC CRC Result 
Implicit Address 0x00AB 
Reset Value 0x0000 
15-0 


CRC_RESULT 


CRC_RESULT (RO) 


Returns the result of the last CRC performed. 
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DAC Slave Stream Registers 


The DAC Feed’s slave stream describes a simple CRTC type of display surface which is typically used 
to generate pixels for the background graphics, where the format stream generates pixels for video 
overlays etc. 


Slave Stream Width 


Implicit Address Ox00AC 
Reset Value 0x0000 
— 


SLAVE_WIDTH (RW) 


Width of the slave stream display in 64bit words - 1. 
Slave Stream Height 


Implicit Address 0x00AD 
Reset Value 0x0000 
— 


SLAVE_HEIGHT (RW) 
Height of the slave display stream in lines - 1. 
Slave Stream Base Address Low 


Implicit Address Ox00AE 
Reset Value 0x0000 


SLAVE_BASEADDR_LO 


Slave Stream Base Address High 


Implicit Address OxO00AF 
Reset Value 0x0000 


SLAVE_BASE_ADDR_LO, HI (RW) 


Bits 26 — 3 of the slave stream base address. This supports a 128MB address range on a 64bit word 
boundary. 


Slave Stream Stride 


Implicit Address 0x00B0 
Reset Value 0x0000 


15-1 
SLAVE_STRIDE 


SLAVE_STRIDE (RW) 


The stride of the slave stream display in 64bit words. This allows for a stride of up to 16KB. 
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DAC Blend Mode 


Implicit Address 0x00B1 
Reset Value 0x0000 


18 2-0 


LUT_SEL Reserved CK_MASK Reserved | BLEND_MODE 
ma CH_MASK 


BLEND_MODE (RW) 


000b ~=—« format stream only — only the data from the format stream is displayed 
001b color key 
010b chroma key 


CK_MASK (RW) 


In color key mode this is a RGB color key mask. Each bit masks whether that color component is used 
in the Color Keying e.g. setting CK_MASK to 111b matches on any color, and setting it to 001b 
ignores the Blue color component when matching. 


These bits are unused in chroma key mode. 


UNPACK_MODE (RW) 


In color key mode: 


000b 8bpp RGB 
001b 16bpp 565 RGB. 
011b 24bpp 888 RGB 
100b 32bpp x8888 RGB 


CH_MASK (RW) 
In chroma key mode this 2 bit field is the VU mask which acts in the same manner as the CK_MASK 


described above. 


LUT_SEL (RW) 
0 LUT is used for the format stream 
1 LUT is used for the slave stream 


DAC Color Key 1 


Implicit Address 0x00B2 
Reset Value 0x0000 


DAC_CK_GREEN DAC_CK_BLUE 
DAC_U_UPPER DAC_U_LOWER 


11 December 1998 108 Version Draft A 


PowerVR 250 
Programming Guide 


Confidential 


Pte Cd Cid 
DAC_CK_RED 
DAC_V_UPPER DAC_V_LOWER 


DAC Color Key 2 


Implicit Address 0x00B3 
Reset Value 0x0000 


DAC_CK_BLUE, GREEN, RED (RW) 


Color Keys for color key mode. 


DAC_U_UPPER, DAC_U_LOWER (RW) 
DAC_V_UPPER, DAC_V_LOWER (RW) 


Upper and lower comparison values for chroma keying. 


DAC Feed Overlay Example 


Vertical Sync Region (A) 


RGB Graphics (G) 


Table 12. Simple Overlay Example 


In the example above a screen display is created with a single video window which uses color keyed 
overlay. The horizontal sync and blanking regions are described by the DAC_HSYNC_ xxx registers 
and the rest of the display is described by eight spans. Each span is processed by issuing a span 
instruction to the DAC Feed and has an associated display list word in the display list. 


The slave stream registers would be set up to match the display resolution of the format stream, as 
well as describing the RGB graphics color format, the start address of the graphics image and the 
color key used for video overlay. 


Span A describes the Vertical Sync region. The DAC_SOF bit is set in the instruction, the 
DACSPAN_W will be set to match the width of the screen (not including horizontal blanking area) and 
DACSPAN_H will be set for the number of lines of VSync required. DAC_GETBLNK will also be set as 
no active pixels are required. The associated display list word will have the DAC_VS and DAC_HS 
flags set. 


Span B describes the top blanking region. DAC_SOL will be set in the instruction and DACSPAN_W 
and DACSPAN_H again used to describe the region size. DAC_GETBLNK will again be set but the 
associated display list will only set the DAC_HS flag. 


Span C describes a graphics only region of the display where the graphics pixels are fetched as part 
of the slave stream. DACSPAN_W and DACSPAN_H are set as before as is DAC_SOL. 
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DAC_GETBLNK is set because no format stream pixels are required. In the display list word 
DAC_DUAL is set which tells the DAC feed to fetch slave stream pixels. DAC_HS is also set. 


Span D is identical to Span C, except that DACSPAN_W only describes the part of the line up to the 
start of the video window and also only describes a span of height 1. 


Span E describes the video window. Because this is not the start of a line, DAC_SOL and DAC_HS 
are not set. DACSPAN_W describes the number of words fetched from memory; this is used in 
conjunction with DAC_SCALE_FACTOR to produce the desired number of output pixels to fill the 
video window. DAC_CSC is set to convert YUV pixels to RGB and DAC_DUAL remains set, since 
color keying is being used. 


Span F is identical to Span D except for the region described and the fact that DAC_SOL and 
DAC_HS are not set. 


Span G is identical to Span C and Span H is identical to Span B except for the regions described. 


Note that Spans D, E and F are sent for each line of the display covered by the video window. 
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VIDEO IN 


The video in feed provides an 8bit VMI 1.4 video data connection between a video decoder or MPEG 
decoder and PowerVR 250. The resulting data is written to either framebuffer or host by the Primary 
core. This gives the ability to perform video capture and display live video windows as well as 
performing VBI data capture. The VMI 1.4 host data is sent over the GPIO port. 


Region fat : 
Video Indata | |———> Ganture Decimation }———-,_ Primary Core 


Table 13. Video In pipeline 


The video in pipeline is controlled both by instructions written from the Primary Core, and by Implicit 
registers. 


Video In / Primary Core Interface 


There are three primary core instructions which can be sent to the video in pipeline. 


Video In Parameter Word 


63-55 5250 40-22 
VILINTSEL _|VI_GENWORD VI_VTDEC VI_VMAX VI_VMIN 


VI_MIRROR VI_VPOL VI_HPOL VI_1FRAME VI_1FIELD 


22-20 19-10 
VILCAPEVEN VI_CAPODD VI_HZDEC VI_LHMAX VI_HMIN 


VI_INTSEL 


Select Video In interrupt source 


0 VRef 
1 End of Field 
VI_GENWORD 


Setting this bit causes the video in pipeline to generate an extra 64bit word of data at the leading edge 
of VRef. 
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VI_VTDEC 


Programs the vertical decimator. 


000b No decimation 
001b Decimate by x2 
010b Decimate by x4 
011b Decimate by x4 
100b Decimate by x8 
101b Decimate by x16 
110b Decimate by x32 
111b Decimate by x64 


VI_VMAX 
Last line of video to capture. Program in field lines from leading edge of VRef. 


VI_VMIN 


First line of video to capture. Program in field lines from leading edge of VRef. 


VI_MIRROR 


Setting this bit flips the order of video pixels within the 64bit word. It can be used to generate a 
mirrored video in stream. 


VI_LVPOL 
Sets the polarity of the video input VRef signal. Setting this bit programs an active high VRef. 


VI_LHPOL 
Sets the polarity of the video input HRef signal. Setting this bit programs an active high HRef. 


VI_4FRAME 


Setting this bit causes the video in pipeline to capture a single frame. 


VI_41FIELD 


Setting this bit causes the video in pipeline to capture a single field. 


VI_CAPEVEN 
Setting this bit causes the video in pipeline to capture only even fields. 


ViI_CAPODD 


Setting this bit causes the video in pipeline to capture only odd fields. It must not be set in conjunction 
with VI_CAPEVEN. 
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VI_LHZDEC 


Programs the horizontal decimator. 


000b No decimation 
001b Decimate by x2 
010b Decimate by x4 
011b Decimate by x4 
100b Decimate by x8 
101b Decimate by x16 
110b Decimate by x32 
111b Decimate by x64 


YUV 4:2:2 blocks are split up so that the U and V components stay with the Y value they are 
referenced to. The decimator requires that the software select an area of the video field whose 
dimensions are exactly divisible by the decimation factor. 


Note that for RGB 5:5:5 operation horizontal decimation is not supported. 


VI_HMAX 


Last pixel of video to capture. Program in pixels from leading edge of HRef. 


VI_HMIN 


First pixel of video to capture. Program in pixels from leading edge of HRef. 
VBI Parameter Word 


ex-at| aoa | at—29 z-20 | toto | vo 


VBI_VMAX VBI_VMIN VBI_HMAX VBI_HMIN 


VBI_VMAX, VBI_VMIN, VBI_LHMAX, VBI_HMIN 


Describes a separate region for capture of VBI data with the same parameter types as used for video 
capture except that a horizontal region unit relates to 2 VBI data samples i.e. a region of width 10 
captures 20 VBI samples. VBI captured data is never decimated. 


Video In Instruction Word 


VI_FILL_FIFO|VI_LPARAM_SEL| VI_VBILEN | VILQSTOP | VI_LSTOP_ | VI_LSTART 


VI_FILL_FIFO 


Tie FIFO input valid high to fill with data 
VI_LPARAM_SEL 

Parameter word contains: 1 =VBI parameters, 0 = video 
VI_VBI_EN 

Capture VBI data 

VI_LQSTOP 

End capture immediately 


VI_STOP 


End capture at end of field 
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VI_START 


Start pipeline using supplied parameters 
Capture starts at the start of the next video field 


Video In Implicit Registers 
Video In Status 1 


Implicit Address 0x007D 
Reset Value 0x0000 


70 
VILPICOUNTLO 


VI_PIXCOUNTLO (RO) 
Low 8 bits of captured pixel count. 


Video In Status 2 


Implicit Address Ox007E 
Reset Value 0x0000 


15-12 
VI_PKCOUNTH 


VI_PIXCOUNTHI (RO) 
High 12 bits of captured pixel count 
Video In Status 3 


Implicit Address 0x007F 
Reset Value 0x0000 


wer2 | as | tl 
VI_LHREF VI_VREF VI_FIELD VI_CURR_LINE 


VI_HREF (RO) 
Shows status of the input HREF signal. 
VI_VREF (RO) 
Shows status of the input VREF signal. 
VI_FIELD (RO) 


Current field status 


0 Even 
1 Odd 


The polarity of the current field is determined from the position of the vsync edge in relation to the 
hsync pulse. If vsync is coincident with hsync, the field is even polarity. 
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[ VREF i - — ; 
ODD FIELD | | 


VREF [ : : 
EVEN FIELD | 


Table 14. Video In Field Polarity 


If the current field polarity is the same as the last one the video in pipeline inverts the field type 
anyway. When the current field is not the same as the previous one, the calculated value is output. 


VI_CURR_LINE (RO) 
Current line count. 


Error Handling 


In the event that video data is not consumed fast enough by the Primary Core data will inevitably be 
lost. The pipeline will continue running, and allows the output FIFO to overflow. This overrun condition 
can be detected by comparing the number of video pixels read with the number in VI_PIXCOUNT 
which counts pixels whether consumed or not. 
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APPENDIX A. TEXTURE FORMATS. 


This document describes how the PowerVR 250 texture formats are stored in memory, and some 
details of the color formats. Textures can be stored either in host (AGP) memory or local framebuffer 
memory and may be stored in normal or twiddled format. 


Twiddled Textures 


Twiddled textures are a mechanism for storing textures in memory while keeping unnecessary page 
breaks to a minimum. 


The general scheme of accessing any of the Twiddled textures can be split up into two stages. 


Interleaving U and V Co-ordinates 


The U and V indices are interleaved in memory as follows:- 
U3V3U,V2U,V,U,V_ 


The subscript denotes the bit position in U or V which gives an order of texel storage shown below. 


Table 15. Twiddled Textures 


The grid represents the position in the bitmap, and the number represents the location that it is stored 
in memory. In the case of rectangular twiddling, the common least significant bits are twiddled, and the 
left over most significant bits are appended to the top. For example, take a 4 bit U co-ordinate and a 6 
bit V co-ordinate. Twiddle the 4 least significant bits of U and V, then append the 2 most significant bits 
of V onto the top of the result of the twiddling thus:- 


Storing Interleaved Texels 


Once the texels have been interleaved, they are stored in Little Endian format. 
1555, 565, 4444, and Bump Map 


These are all 16 bit data items, so no packing is required. 
YUV422 


The data item used in 422 is a 32 bit word. The texture interface accesses four 16 bit data items. The 
422 data item is broken up as follows :- 
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15-8 7-0 


YO U 


Lets call this YOU. 


15-8 7-0 
Y1 V 


Call this Y1V. 
The luminance of YOU is the left most of the two texels. 


The indexing of these 16 bit data items are then treated like 1555 etc. 


Scan Order Textures 


If the texture is a Stride Texture, then the address calculation is:- 
Index = V*StrideRegister*32 + U 

otherwise the address calculation is:- 
Index = V*VSize +U 

1555, 565, 4444, and Bump Map 


For example, the following diagram shows how a 128x128 16 bit scan order texture would be stored in 
the texture memory. 


wonssavocuctvenseveesseesesaesiebes, 


Loe aa | ar [ = | a Laer | —__e— 


0x80 | 0x81 | 0x82 | 0x83 | | Ox84 | | 0x85 


The numbers in the brackets are the texel coordinates. 
YUV422 


For example, the following diagram shows how a 128x128 YUV422 scan order texture would be 
stored in the texture memory. 


YOU(0,0) | YIV(I,0) | YOU(,0) | YIV@G,0) | YOU(4,0) | YIVG,0) 


YIVG,) 


0x80 | 0x82 | 0x83 | | 0x84 


The numbers in the brackets are the texel coordinates. 
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VIP-Mapped textures 


MIP-Mapped textures are restricted to being square and Twiddled. The 1x1 texture is stored first, 
followed by the 2x2 etc. The Pixel offsets from the “Texture Address” to the texture levels are:- 


Texture 16bit offset 

1x1 0x00003 | 

2x2 0x00004 

4x4 0x00008 

8x8 0x00018 
16x16 0x00058 
32x32 0x00158 
0x00558 
0x01558 
| 256x256 | 0x05558 
512x512 0x15558 

| 1024x1024 | 0x55558 


MIP-Mapped YUV422 textures 


MIP-Mapped YUV textures are stored in the above manner, but there is a special case for the 1x1 
map. The YUV color format requires two texels for the conversion, so a YUV 1x1 texture does not 
make much sense. Therefore a RGB565 texel is placed in the 1x1 map. 


MIP-Mapped 8BPP Palletised textures 


The 8 bit resolution offsets from the “Texture Address” to the texture levels are:- 


Texture 8bit offset 
1x1 0x00003 
2x2 0x00004 
4x4 0x00008 
8x8 0x00018 

16x16 0x00058 

32x32 0x00158 
64x64 0x00558 
128x128 0x01558 
256x256 0x05558 
512x512 0x15558 

| 1024x1024 | _0x55558 | 


The 8 bit resolution offset gets converted to a 16 bit address by dividing by 2, then the 8 bit item is 


picked out using the Little Endian convention. 
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MIP-Mapped 4BPP Palletised textures 


The 4 bit resolution offsets from the “Texture Address” to the texture levels are:- 


Texture Abit offset 
1x1 0x00003 
2x2 0x00004 
4x4 0x00008 
8x8 0x00018 

16x16 0x00058 
32x32 0x00158 


64x64 


0x00558 


128x128 


0x01558 


256x256 


0x05558 


512x512 0x15558 


1024x1024 0x55558 


The 4 bit resolution offset gets converted to a 16 bit address by dividing by 4, then the 4 bit item is 
picked out using the Little Endian convention. 


VQ Textures 


The VQ texture format discussed below assumes the twiddled format is used. 


A VQ texture consists of 256 Code Book entries, followed by a 2D array of 8 bit Indexes. Each Code 
Book entry has 4 texels. These texels are stored in twiddle order. The “Texture Address” points to the 
start of the Code Book. Directly after the Code Book is stored the 2D array of Indexes. These 8 bit 
indexes are twiddles, and then stored in Little Endian (like the 8BPP palletised textures). 


NOTE 1: In PowerVR 250, the Indexes and Code Book are not separated by the “BANK BIT”, they are 
stored contiguously. 


NOTE 2: If the user wishes to have a smaller than 256 entry Code Book, they can put the “Texture 
Address” in the address range of the previous texture, and use only the upper Indexes. 
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MIP-Mapped VQ textures 


In the case of MIP-Mapped VQ textures, all the MIP-map levels share one Code Book. 


The Byte offsets from the end of the Code Book to the texture levels are:- 


Texture [ Byte offset 
1x1 0x00000 
2x2 0x00001 
4x4 0x00002 
8x8 0x00006 

16x16 0x00016 
0x00056 


0x15556 


128x128 
256x256 
512x512 
1024x1024 


Color Expansion 
Inside PowerVR 250, the colors have 8 bits per component. As a texture color is fetched, it is 
expanded up to 8888. 


If the texture type is not ‘Scan Order’, the combinations of color component expansion are described 
below:- 


1 bit to 8 bit. 


The single bit is repeated in all 8 bits thus:- 
CO => ¢€Oo cO CO CO CO CO CO CO 
4 bit to 8 bit. 


The four bits are repeated in the least significant bits thus :- 
CS C2 El Co > C3 C2 Gl CO G3. C2 Gl CO 
5 bit to 8 bit. 


The top three bits are repeated in the least significant bits thus :- 
C4 C3 C2 C1 C0 > G4 C3 €2 Cl CO C4 CS 
6 bit to 8 bit. 


The top two bits are repeated in the least significant bits thus :- 

Gs €4 C3 €2 CL CO > Co C4 C3 €2 Cl CO Cs C4 

If the texture type is ‘Scan Order’, then the color conversion is performed as follows :- 
1 bit to 8 bit. 


The single bit is repeated in all 8 bits thus:- 
co > CO CO CO CO CO CO CO CO 
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4 bit to 8 bit. 


Four zeros are appended in the least significant position :- 


a) 


3 €2 Ci CO > GS C2 Ci CO OO Q 


5 bit to 8 bit. 


Three zeros are appended in the least significant position :- 


oO 


C4 €3 ¢2 Cl CO > c4 €3 C2 C1 co a 0 
6 bit to 8 bit. 


Two zeros are appended in the least significant position :- 
Ce C4 C3 C2 CL ico > ch C4 C3. €2 Ci co O 0 


The reason ‘Scan Order’ textures are treated differently is because of the main use of this texture 
type, as a Backdrop texture. A Backdrop texture is used as the destination of a render, and then used 
as the background in the next render. It is essential a backdrop texel is not modified when it is read in 
and written back to the same position. If a backdrop texel is slightly modified, this error will accumulate 
over a few renders and destroy the starting image. 


The color expansion and the frame buffer dithering can conspire to modify the texel. The default color 
expansion method does set some of the least significant bits. When dithering, if any of the bits that are 
going to be truncated are set, then the color component may have 1 add to it. Therefore, some of the 
color values will be modified if dithering is enabled. 


The solution is to color expand ‘Scan Order’ texels with zeros. The result of this is that the dithering 
will never add 1 to a color component of a backdrop texel. 


YUV422 to 8888 Conversion 


R=Y+(11/8)*V 
G = Y - (0.25*(11/8)*U + 0.5*(11/8)*V) 
B=Y + 1.25*(11/8)*U 

o = 255 


NOTE 1: The U and V values are stored as 8 bit offset binary numbers in Texture Memory. 
NOTE 2: The Y values are stored as 8 bit positive numbers in Texture Memory. 
NOTE 3: The R,G,B, values are clamped to 0 and 255. 


Performance Considerations of Texture Alignment 


The effective page size in PowerVR 250 is 2048 Bytes. A 32x32 16 bit texture can be stored ina 
single page, as long as the “Texture Address’ is aligned to a 2KB boundary. If a 32x32 texture is 
aligned to a 1KB boundary, then there will be a line of page breaks across the centre of the texture. 
This can significantly effect the performance of rendering the texture. Therefore, for optimal 
performance, the space allocated for a texture should be rounded up to the nearest 2KB. For textures 
smaller than 32x32, this is very wasteful. An intelligent texture memory manager could allocate 4 
16x16 textures together, for example. 


The optimal alignment for VQ textures would be 2KB, this is because the Code Book would placed in a 
single page. 


For non-VQ MIP-mapped textures, the optimal alignment occurs when the 32x32 MIP-map level is 
aligned to the 2KB boundary. This alignment happens when the “Texture Address’ is set to the 2KB 
boundary + 1360 Bytes. 
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Immediate Registers 


DAC LING COMPAL ssccecasissnswmacesangeast eon wea 21 
DAC Line Count...........ccccceccceeeeteeeesteeeees 21 
AG REE LO eesti Sadie ieteoniuctantictascagiomsenitonors 18 
DAG: PEI si Sovcnsesnsancveapueicrenner eee eneentons 18 
NC Interface A... ccccceeseeeeeesteetetteeeeenes 47 
NC INSTA CEHB wscvacssiissinauns nassansissaavanaionneracaeses 47 
JLSHHUDE MASK vrccrecmnanvusarenesemamnsnareanenarens eames 20 
INferrpt Status <s.cccsvisiouweerovexemversermmnenmeemnns 20 
Primary Core Reset Vector High .................. 51 
Primary Core Reset Vector Low................... 51 
Primary Core Soft Interrupt...........00....0008 51 
Primary Core Video Clock Div...............0.05 51 
Read ACCESS 1..........:cccccccceesteeecesteeeneeteeeeeaes 7 
Read ACCESS 2..........cccccccccccceeesseceeeeesssssneeeeess 7 
SOMWALES ROSE: scnsscsousenerenaenarwenciosmasnyaes 22 
Thread Enable 0 sscssscsssveeensnssevsesvmnencsaiorons 49 
Thread Enable 1.0.0... eeceececeeseeesteeeeeneee 49 
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TNGAG: ENABIS:2 ccccsrsinctsceveavors samemersmaneasmeusinee 49 
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Immediate Registers 


DAC Line COMpPares. ssicesscassverensrveass 0x0060 
DAC Line Count..........0c cee eeeeeeees 0x0064 
DAG PLO) osicanscntoaniammensinsdaeotancisaagns 0x0080 
DAG Pl, 4 casaseenpeasenssemimaneonananieementaainnanics 0x0084 
WC Interface A............ccccesssnneecesssneesees 0x0050 
HK$ Tal (la f= \el— | = eee eee 0x0054 
Inte mu pt MASK secsccipesceermencnesauaeeeenss 0x0074 
INCSRPUPE-STAIUWS iss ececesortermpersersemierenmnns 0x0070 
Primary Core Reset Vector High ......... 0x0044 
Primary Core Reset Vector Low........... 0x0040 
Primary Core Soft Interrupt................ 0x0048 
Primary Core Video Clock Div............. 0x004C 
r'REAGH ACCESS els is as eee ote oat ace 0x00C8 
miRGACdMACCESSIZ 5 secatermaaties suibedeoreceseetcn 0x00C4 
Software ReSet .............::ccccseseceeeesneees Ox00A4 
Thread Enable: © sscewcecvasonrersscerenatness 0x0000 
Thread Enable 1 .............:cesessseeeeeeees 0x0004 
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THREAE! ERABIS 2 weiss cssdccensssanevesanawwanertndi 0x0008 
Thread Enable 3.................cccccceeeeeeeeees 0x000C 
Thread Kicker 0 o0........ccceceeeececeeeeeeeeeeees Ox00BO 
TAFE ad RICKS 4 sscccsascciomemenissnenmeencssives 0x00B4 
Thread Kicker 2 ................cccccccceeeeeeeees 0x00B8 
Thread Kicker 3 ..........000....cccceeeeeeeeeeeees Ox00BC 
Thread RECOVER Os: csescscscugsmsnesanseaeacones 0x0020 
THREAD RECOVER 1 sscseccccssincnscesessessascenin 0x0024 
Thread Recover 2............cccccccccceeeceeeees 0x0028 
Thread Recover 3................cceceeeeeeeees 0x002C 
VGA Page: Sele Chi csssssvensssisesonavsecveases Ox00A0 
WMI HOSE Obie ss excsieny citcciaci estates canmannmuns 0x00D0 
VMI Host 1.0.0.0... 000. ccccesesesssesssesssseereees 0Ox00D4 
VMI HOSE 2 occcinscessassaniensnnowiunensensonnmueviees 0x00D8 
AN ol eo eee ee er 0x00DC 
WVfite"AGCESSibis nin iss 8, anitasncceecne 0x00CO0 
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DAC Line Compare.............. eee 0x0128 
DAC Line Count.............ccccceceesteeeeees 0x0129 
DAG PEL (Olesmesmacmnanremmncomnenasanntaueens 0x0120 
DAC: PEL Vssserannnecnmedenmecnenmmcnnte 0x0121 
WC Interface A........ccceeeceecceeeeeesteeeeeees 0x0118 
WC Interface B..........eceseecceessteeeeeenes 0x0119 
Interrupt Mask..............cc:ccceccccceeeeessreeee 0x014D 
INTEMUDE Salus ticsccereaserdoacvscnanewerceseaas 0x014C 
Primary Core Reset Vector High ......... 0x0011 
Primary Core Reset Vector Low........... 0x0010 
Primary Core Soft Interrupt.................. 0x0012 
Primary Core Video Clock Div............. 0x0013 
Software RESEt si ccsssscacssesssoacssermcnvseseees 0x0169 
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Thread Enable 0............00...cccccceeeeeeeee 0x0000 
Thread Enable 1.................eeee 0x0001 
Thread Enable 2.............ccccccccssssseseeees 0x0002 
Thread Enable 3................ccccceeeeeeeeeeees 0x0003 
Thread Kicker 0...........ccccccccesesssseeeeeeees 0x0100 
Thread Kicker 1 .........ccccccccescsecseeeeeeeees 0x0101 
Thread Kiker 2ivcsccsssencssccssnsasesnssanteacens 0x0102 
Thread WICKER Shisisssconcoucwssstoscenstinwesensss 0x0103 
Thread Recover 0.................ccccseeeeeeeees 0x0008 
Thread RECOVEF 1..........cccccceceseeeeeeeeees 0x0009 
Thread Recover 2................ccccceeeeeeeeees Ox000A 
Thread RECOVER 3.........cccccccecccccceeeeeeees 0x000B 
VGA Page Select............:cccceseesesteeeees 0x0168 
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3D Address BaSe...........ccccccceeetsneneeeees Ox028' Emulation Register Ox1F .....................0%0214 
AGP" PONCE cemivncanenmersiesamcuemimes ORO DO Emulation Register 0x20 ..............0.. 0x0215 
AGP? PLUS IW es ccnumsmnesessicpennntecnmemene demas 0x028F Emulation Register 0x21 ..................... 00216 
DAC Blend Mode................:::ceeeeeeeees 0x00B1 Emulation Register 0X22 ..0............. 0x0217 
DAG: Burst Comttil xrcccccccerrnnctectinwansians 0x00A1 Emulation Register 0x23 .................05 0x0218 
DAG GLUT, Dalal weserccsresoinemeansemenaneees 0x00B5 Emulation Register 0x24... 0x0219 
DAC CLUT PixelMask .................. 0x00B6 Emulation Register 9... Ox01FE 
DAC CLUT Read Address................... 0x00B EMULALOM Stat cscs scceserneseraersqeueraseene Ox01F5 
DAC CLUT Write Address. ................... Ox00B4 Frame Buffer Offset...........0.....00..05 0x0287 
DAG ‘Colour KEY c scesewscsxanesesceameswnes 0x00B2 GART ISP Param Remap Hii............... 0x028C 
DAC Colour Key 2 ..........cccccceeeeteeeeeees 0x00B3 GART ISP Param Remap Lo............... 0x028B 
DAC COTO le ccscvecersanooreivicvevesroreewesonnns 0x00A9 GART TA Remap Hii........................0005 0Ox028A 
DAG' GRE 'ConttOl crccrsectanvesernsSbexcoxenaee Ox00AA GART TA REMAD! 10 ccesersstcsnssenctoansesecs 0x0289 
DAGCRE RESUIBreccasissscossnewsresscvavseneze Ox00AB GART TSP Param Remap Hi.............. 0x028E 
DAC Cursor X Position ..............:002 0x00A7 GART TSP Param Remap Lo.............. 0x028D 
DAC Cursor Y Position ...............000 0x00A8 INDOXIAGGECSS See ie ioctl essa 0x0104 
DAG FIFO! S Bl Ret sereseeerercereeetneees Ox00A2 IndexaDatavn es <ae eee eas 0x0105 
DAC: FIOW COMM  Ssccssiet screeteoreesseess 0x00A0 PGDBIS Trsswennsssmnewamavesisansateaoers 0x025C 
DAC HS Back Porch...........:::ccceeeeee Ox00A4 PGI ClaSS! COM iivscanscnnancenrsecenvarsiectonneas Ox025A 
DAC HS Front Porch .............ceeeeeeeeeee 0x00A3 PCI Cycle Base Address. .................44. 0x0292 
DAG HS Widtitticdtesseccsoceceeteensnerosonawathws Ox00A5 PGC) DEVIC! Dbssccnscsusecwurencarensssenssocxeys 0x0258 
DAC. SYNG' Control svcssssvaseccanssesscenssasees 0x00A6 PG) Latency Tim ivccsrscccesevasoreessomeseess 0x025D 
Emulation FB Base................c0ceeee Ox01F4 PCIREVISION eves accenrcsossesanotnammavercncs 0x025B 
Emulation Palette 1/0......0...0....e 0x01FD PCI Subsystem Device ID................... Ox025E 
Emulation Palette 3/2... Ox01FC PCI Subsystem Vendor ID ................. 0x025F 
Emulation Palette 5/4...0000.... ee 0x01FB PGI VGNG OM ID is: secccownsxasoamavimenueneausnes 0x0259 
Emulation Palette 7/6............cce OxO1FA ROM Control cvccavcvesscenwccvsarscveenetonenies 0x00B8 
Emulation Palette 9/8 ..........0.. ee Ox01F9 ROM Data FGM s sscacsassecestenevciesamasvnass 0x00BC 
Emulation Palette B/A........... ee Ox01F8 ROM Data LOW............::cccceeseeeetteeeeee 0x00BB 
Emulation Palette D/C ...........0. Ox01F7 RONIOMS CB occ cstinacviedatonncinumnemrennssturcttents 0x0288 
Emulation Palette F/E......... ee Ox01F6 ROM Read Address ............. eee Ox00B9 
Emulation Register Ox0A................... Ox01FF ROM Write Address .............0::eceee OxOOBA 
Emulation Register OxOB. .................. 0x0200 SDRAM / Arbiter Config 0...............0.. 0x0284 
Emulation Register Ox0C................6. 0x0201 SDRAM / Arbiter Config 1................. 0x0285 
Emulation Register Ox0D.................2. 0x0202 SDRAMIGONMNG) Overesescsvennexerscewsrscenne 0x0280 
Emulation Register OxOE ...............26. 0x0203 SDRAM Config 1 ......ssscsasseceovnaccseerceseee 0x0281 
Emulation Register OxOF ................6. 0x0204 SDRAM ‘GONG: 2: essscocsneeceswzanvunswesnetezaes 0x0282 
Emulation Register 0x10...............08 0x0205 SDRAM Power Status ............. ee 0x0293 
Emulation Register 0x11..............28: 0x0206 SDRAM Refresh ..............::cccccseeeeteeeees 0x0283 
Emulation Register 0x12... 0x0207 Slave Stream Base Address High....... OxO00AF 
Emulation Register 0x13...............00. 0x0208 Slave Stream Base Address Low........ Ox00AE 
Emulation Register 0x14... 0x0209 Slave Stream HeigNt.................eeeeee 0x00AD 
Emulation Register 0x15................00. 0x020A Slave Stream Strideé...............eeeeee 0x00BO 
Emulation Register 0x16... 0x020B Slave Stream Width...............::cceeee Ox00AC 
Emulation Register 0x17..............:06 0x020C VidEO I Status) 1) sssescusmerecenenmnawssavian 0x007D 
Emulation Register 0x18................2 0x020D Video In Status 2.2.0.0... eee Ox007E 
Emulation Register 0x19... 0x020E Video! IN StatUS: 3 cee.sseceseescssssyaveccarsverexe Ox007F 
Emulation Register Ox1A................0. 0x020F Writeburster Burst Cost.............0....... 0x028B 
Emulation Register 0x1B ................ 0x0210 Writeburster Burst Length .............0..... 0x028C 
Emulation Register 0x1C......... ee 0x0211 Writeburster Page Break Cost............. 0x0289 
Emulation Register Ox1D...............00 0x0212 Writeburster Page Hit Cost...........0...... 0x028A 
Emulation Register Ox1E ...............22: 0x0213 Writeburster Timeout...................000 0x0288 
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FOG) ClalIMpIMAX.-sscnneawiiecrcanstsoinernuononins 0x00BC Texture Endian Control...............:005 Ox00E8 
Fog Clamp Min ..............:cceeeeeeeeeeetteeees 0x00C0 TSP Cache Hit Count........... ee 0x0194 
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APPENDIX E. DEVICE ERRATA 


| Problem | Solution 


3D FPU Parameter configuration register must 
not be set to OxOF 


Don't use this setting | 
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